Skip to content

Datasets and Dataset Collections

Below is a collection of free / open datasets and lists of datasets at the interface between chemistry, materials and machine learning / AI.

Foundations

  • Hugging Face

    Ecosystem of pretrained models, datasets, and Python libraries for transformers, diffusion models, and other modern ML architectures.

    License: -

    transformersLLMsmodel hub

Chemistry

Materials