Pr🥨mptzl

Turn state-of-the-art LLMs into zero⁺-shot PyTorch classifiers in just a few lines of code.

Promptzl offers:

🤖 Zero⁺-shot classification with LLMs
🤗 Turning causal and masked LMs into classifiers without any training
📦 Batch processing on your device for efficiency
🚀 Speed-up over calling an online API
🔎 Transparency and accessibility by using the model locally
📈 Distribution over labels
✂️ No need to extract the predictions from the answer.

For more information, check out the official documentation.

Installation

pip install -U promptzl

Getting Started

In just a few lines of code, you can transform a LLM of choice into an old-school classifier with all it's desirable properties:

Set up the dataset:

from datasets import Dataset

dataset = Dataset.from_dict(
    {
        'text': [
            "The food was absolutely wonderful, from preparation to presentation, very pleasing.",
            "The service was a bit slow, but the food made up for it. Highly recommend the pasta!",
            "The restaurant was too noisy and the food was mediocre at best. Not worth the price.",
        ],
        'label': [1, 1, 0]
    }
)

Define a prompt for guiding the language model to the correct predictions:

from promptzl import FnVbzPair, Vbz
prompt = FnVbzPair(
    lambda e: f"""Restaurant review classification into categories 'positive' or 'negative'.

    'Best pretzls in town!'='positive'
    'Rude staff, horrible food.'='negative'

    '{e['text']}'=""",
    Vbz({0: ["negative"], 1: ["positive"]}))

Initialize a model:

from promptzl import CausalLM4Classification
model = CausalLM4Classification(
    'HuggingFaceTB/SmolLM2-1.7B',
    prompt=prompt)

Classify the data:

from sklearn.metrics import accuracy_score
output = model.classify(dataset, show_progress_bar=True, batch_size=1)
accuracy_score(dataset['label'], output.predictions)
1.0

For more detailed tutorials, check out the documentation!

Name	Name	Last commit message	Last commit date
Latest commit LazerLambda Merge pull request #94 from LazerLambda/docu Jan 17, 2025 2cecede · Jan 17, 2025 History 312 Commits
.github/workflows	.github/workflows	Update workflow	Nov 18, 2024
docs	docs	[docs] Add benchmark table	Jan 17, 2025
promptzl	promptzl	Update version number	Dec 22, 2024
tests	tests	Update names and docu	Dec 22, 2024
.gitattributes	.gitattributes	Add .gitattributes	Sep 2, 2024
.gitignore	.gitignore	Update .gitignore	Dec 18, 2024
.readthedocs.yaml	.readthedocs.yaml	Update .readthedocs.yaml	Nov 18, 2024
LICENSE.md	LICENSE.md	Resolve all TODOs, update LICENSE to Apache 2.0, add numpy<2.0.0 bound	Nov 30, 2024
README.md	README.md	[docs] Update README.md	Dec 26, 2024
RELEASES.md	RELEASES.md	Update release message	Dec 22, 2024
pyproject.toml	pyproject.toml	Add sentencepiece for tokenizer compatibility	Dec 15, 2024
test-requirements.txt	test-requirements.txt	Update class names, update docs, update requirements	Nov 20, 2024
tox.ini	tox.ini	Update docstrings and autodocs, remove superfluous arguments	Dec 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pr🥨mptzl

Installation

Getting Started

About

Releases 3

Packages

Languages

License

LazerLambda/Promptzl

Folders and files

Latest commit

History

Repository files navigation

Pr🥨mptzl

Installation

Getting Started

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 3

Packages 0

Languages

Packages