Automatic benchmarking of large multimodal models via iterative experiment programming

Setup

Install dependencies

# clone project
git clone https://github.com/altndrr/apex
cd apex

# install requirements
# it will create a .venv folder in the project root
# and install all the dependencies using flit
make install

# activate virtual environment
source .venv/bin/activate

Setup environment variables

# copy .env.example to .env
cp .env.example .env

# edit .env file
vim .env

Usage

The only entry points is main.py. It must be called with the query argument, which is the question to ask the model.

#  ask the model a question
python main.py query="Can models ... ?"

Note: the first run will take a while, as it will download the necessary models and datasets.

Configuration

The full list of parameters can be found under configs, but the most important one is:

main.yaml: main configuration file for the entry point.

Parameters can be overwritten by passing them as command line arguments. You can additionally override any parameter from the config file by using the ++ prefix.

# limit the number of experiments to 3
python main.py query="Can models ... ?" ++max_experiments=3

Development

Install pre-commit hooks

# install pre-commit hooks
pre-commit install

Run tests

# run fast tests
make test

# run all tests
make test-full

Format code

# run linters
make format

Clean repository

# remove autogenerated files
make clean

# remove logs
make clean-logs

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github		.github
configs		configs
data		data
logs		logs
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automatic benchmarking of large multimodal models via iterative experiment programming

Setup

Install dependencies

Setup environment variables

Usage

Configuration

Development

Install pre-commit hooks

Run tests

Format code

Clean repository

About

Languages

License

altndrr/apex

Folders and files

Latest commit

History

Repository files navigation

Automatic benchmarking of large multimodal models via iterative experiment programming

Setup

Install dependencies

Setup environment variables

Usage

Configuration

Development

Install pre-commit hooks

Run tests

Format code

Clean repository

About

Resources

License

Stars

Watchers

Forks

Languages