Contributing to MLIP Arena

MLIP Arena is an open, community-driven benchmark platform. Contributions of new models, benchmarks, bug fixes, and improvements are welcome.

Two main ways to contribute

Add a new model

Add a new benchmark

Design a new benchmark task that tests a physical property or failure mode not yet covered.

You can also contribute bug fixes, documentation improvements, and infrastructure changes through a standard pull request.

Code quality standards

Linting

MLIP Arena uses ruff for linting and formatting. Before submitting a PR, run:

ruff check .
ruff format .

The CI pipeline enforces ruff checks on every push and pull request.

Testing

Tests live in the tests/ directory and are run with pytest:

tests/
  test_app.py
  test_elasticity.py
  test_eos.py
  test_external_calculators.py
  test_internal_calculators.py
  test_md.py
  test_mof.py
  test_neb.py

Run tests locally before submitting:

pytest -vra -n 5 --dist=loadscope tests

All new models should pass tests/test_external_calculators.py. New benchmark tasks should include a corresponding test file.

The test suite runs across Python 3.10, 3.11, and 3.12 in CI. Make sure your code is compatible with all three versions.

How to fork and submit a PR

Fork and clone the repository

Fork atomind-ai/mlip-arena on GitHub, then clone your fork:

git clone https://github.com/<your-username>/mlip-arena.git
cd mlip-arena

Set up the development environment

Use the provided installation script for a clean environment with uv:

# Install uv (fast pip alternative)
curl -LsSf https://astral.sh/uv/install.sh | sh
source $HOME/.local/bin/env

# One-script installation (Linux)
bash scripts/install.sh

# macOS
bash scripts/install-macosx.sh

Install in a fresh virtual environment. Several popular MLIPs have conflicting dependencies that can break each other when installed together.

Create a feature branch

git checkout -b feat/my-model-or-benchmark

Make your changes

Follow the guides for adding a model or adding a benchmark. Run linting and tests locally before proceeding.

Push and open a pull request

git push origin feat/my-model-or-benchmark

Open a pull request against the main branch of atomind-ai/mlip-arena. Describe what you added or changed, and link any relevant issues or papers.

GitHub Actions CI/CD pipeline

Every push and pull request triggers the following automated pipeline:

Workflow	File	Trigger
Python Test	`test.yaml`	Push and PR to `main`
Sync to Hugging Face	`sync-hf.yaml`	After `Python Test` succeeds on `main`
Release	`release.yaml`	Manual / tag

Python Test (test.yaml) runs on Ubuntu across Python 3.10, 3.11, and 3.12. It installs all dependencies, logs in to Hugging Face to fetch model checkpoints, runs the full pytest suite in parallel, and performs a trial push to the Hugging Face Space to verify compatibility. Sync to Hugging Face (sync-hf.yaml) runs only after the test workflow succeeds on main. It squashes history and force-pushes to the Hugging Face Space, making the live leaderboard reflect the latest state of the repository.

PRs that fail CI will not be merged. Check the Actions tab in your PR to diagnose failures.

Finding outstanding tasks

The GitHub project page tracks outstanding tasks, known issues, and planned features. Look there for issues labeled good first issue or help wanted if you are looking for a place to start.

Discussing new ideas

Before investing significant time in a new benchmark or major feature, open a thread in GitHub Discussions to propose the idea and get feedback from maintainers and other contributors.

License

MLIP Arena is released under the Apache-2.0 license. By contributing, you agree that your contributions will be licensed under the same terms.

Individual models registered in MLIP Arena may carry their own licenses (MIT, BSD-3-Clause, GPL-3.0, etc.). Check the license field in mlip_arena/models/registry.yaml for each model.

​Two main ways to contribute

Add a new model

Add a new benchmark

​Code quality standards

​Linting

​Testing

​How to fork and submit a PR

​GitHub Actions CI/CD pipeline

​Finding outstanding tasks

​Discussing new ideas

​License

Two main ways to contribute

Code quality standards

Linting

Testing

How to fork and submit a PR

GitHub Actions CI/CD pipeline

Finding outstanding tasks

Discussing new ideas

License