Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

ODAC25

Dataset Overview
PropertyValue
Size~70 million DFT calculations
DomainMetal-organic frameworks (MOFs)
AdsorbatesCO2, H2O, N2, O2
MOFs~15,000 structures
LabelsTotal energy (eV), forces (eV/A)
Level of TheoryPBE+D3 (VASP)
LicenseCC BY 4.0
DownloadHuggingFace

The Open DAC 2025 (ODAC25) dataset contains nearly 70 million DFT single-point calculations for CO2, H2O, N2, and O2 adsorption in nearly 15,000 metal-organic frameworks (MOFs). This dataset represents a significant expansion upon ODAC23, introducing greater chemical and configurational diversity through functionalized MOFs, high-energy GCMC-derived placements, and synthetically generated frameworks.

The dataset enables training of state-of-the-art machine-learned interatomic potentials for direct air capture applications. All structures are labeled with total energies (eV) and forces (eV/Å) computed using VASP with the PBE+D3 functional.

All information about the dataset is available at the ODAC25 Huggingface site. For questions or issues, please open a GitHub issue in this repository.

Dataset format

The dataset is provided in ASE DB compatible lmdb files (*.aselmdb).

Citing ODAC25

The ODAC25 dataset is licensed under a Creative Commons Attribution 4.0 License.

Please consider citing the following paper in any publications that uses this dataset:

@misc{sriram2025odac25,
    title={The Open DAC 2025 Dataset for Sorbent Discovery in Direct Air Capture}, 
    author={Anuroop Sriram and Logan M. Brabson and Xiaohan Yu and Sihoon Choi and Kareem Abdelmaqsoud and Elias Moubarak and Pim de Haan and Sindy Löwe and Johann Brehmer and John R. Kitchin and Max Welling and C. Lawrence Zitnick and Zachary Ulissi and Andrew J. Medford and David S. Sholl},
    year={2025},
    eprint={},
    archivePrefix={arXiv},
    primaryClass={},
    url={}, 
}