An agentic framework for rational inverse design of materials by replicating and incorporating the expertise of human domain experts across various design tasks and target properties.
The model starts by inputting the material's initial structure as a textual representation. The AI agent dynamically retrieves design guidelines for Property X from scientific literature, the Internet or other resources. Based on these domain-knowledge of guidelines, and any additional design constraints provided in natural language, the agent proposes a new candidate and assesses its chemical feasibility in real-time. Next, it estimates Property X for the new candidate, incorporating epistemic uncertainty, using a cost-efficient surrogate model. Optionally, as part of a human-in-the-loop process, the human chemist can review the agent's new candidates and chain-of-thoughts, providing feedback and suggesting further modifications or constraints, creating an opportunity for human-AI collaboration to guide the exploration process. The agent continues exploring the chemical space, guided by chemistry-informed rules, until it meets the convergence criteria.Collaborative efforts between a human expert and AI agents hold significant promise. In the case of molecular design for WDR5 ligands, we examined human guidance to refine the modifications based on docking scores and structural generation.
dziner_human-in-the-loop-animation.mp4
We applied dZiner to the rational inverse design of likely synthesizable organic linkers for metal-organic frameworks with high CO2 adsorption capacity at 0.5 bar of pressure. These MOFs come with pcu topology and three types of inorganic nodes: Cu paddlewheel, Zn paddlewheel, and Zn tetramer (three most frequent node-topology pairs in the hMOF dataset). Design constraints such as keeping molecular weight lower than 600 g/mol and excluding certain potentially unstable functional groups (nitrosylated, chloro-, fluoro- amines) are simply added to the model as natural language text.
dZiner can work with different textual representation for materials. You can even apply your own surrogate model to your own materials inverse design problem. Here are some example notebooks that can help you get started:
- Quick Start
- Example 1 - MOF Organic Linker Design for CO2 Adsorption
- Example 2 - Desigining Peptides with a Lower Hemolytic Activity
- Example 3 - Low CMC Surfactant Design
- Example 4 - Different Choices of Surrorgate Models
- Example 5 - WDR5 Ligand Inverse Design
- Example 6 - Human-in-the-loop design of more soluble molecules with IBM's embedding model (smi-TED)
For more details checkout preprint and if you are using our methodology, please consider citing us using the citation below:
@misc{ansari2024dzinerrationalinversedesign,
title={dZiner: Rational Inverse Design of Materials with AI Agents},
author={Mehrad Ansari and Jeffrey Watchorn and Carla E. Brown and Joseph S. Brown},
year={2024},
eprint={2410.03963},
archivePrefix={arXiv},
primaryClass={physics.chem-ph},
url={https://arxiv.org/abs/2410.03963},
}
Note that with GPU installation, the wall-time of the domain-knowledge tool that uses RAG becomes much shorter. After this installation, you wiil need to install the surrogate model you want to use (see below).
pip install dziner
conda install -c pytorch/label/nightly -c nvidia faiss-gpu=1.8.0
python -m ipykernel install --user --name dziner --display-name "dziner"
pip install dziner
conda install -c pytorch/label/nightly faiss-cpu
python -m ipykernel install --user --name dziner --display-name "dziner"
You can clone the source code and install in developer mode:
conda create -n dziner python=3.11.9
conda activate dziner
git clone https://github.com/mehradans92/dziner.git && cd dziner
pip install -e .
conda install -c pytorch/label/nightly -c nvidia faiss-gpu=1.8.0
python -m ipykernel install --user --name dziner --display-name "dziner"
If you do not have a gpu in your machine (OSX for example) you will need to execute the following instead:
conda create -n dziner python=3.11.9
conda activate dziner
git clone https://github.com/mehradans92/dziner.git && cd dziner
pip install -e .
conda install -c pytorch/label/nightly faiss-cpu
python -m ipykernel install --user --name dziner --display-name "dziner"
Make sure you follow additional installation of packages for different surrogate models. Requirements can be found in each folder.
cd dziner/surrogates/YOUR_MODEL_OF_INTEREST
pip install -r requirements.txt
If you are using closed-source LLMs from OpenAI or Anthropic, you will need to have a valid OPENAI_API_KEY
or ANTHROPIC_API_KEY
. If you are on a windows machine you can add keys in your Environment Variables. For linux systems set the key by adding this line to ~/.bashrc
:
export OPENAI_API_KEY=your_openai_api_key
export ANTHROPIC_API_KEY=your_anthropic_api_key
If you are using the jupyter notebook examples in this repo you may need to restart your kernel after adding the key to your environment