Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
dev		dev
docs		docs
examples		examples
tests		tests
transformers_cfg		transformers_cfg
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
error_refactor.patch		error_refactor.patch
generate_japanese.patch		generate_japanese.patch
profiling_japanese.txt		profiling_japanese.txt
profiling_reset_utf8_decode.txt		profiling_reset_utf8_decode.txt
profiling_reset_utf8_decode___consume_char_code_point.txt		profiling_reset_utf8_decode___consume_char_code_point.txt
profiling_reset_utf8_decode___consume_char_code_point___torch.any.txt		profiling_reset_utf8_decode___consume_char_code_point___torch.any.txt
profiling_reset_utf8_decode___consume_char_code_point___torch.any_100.txt		profiling_reset_utf8_decode___consume_char_code_point___torch.any_100.txt
profiling_results.txt		profiling_results.txt
profiling_results_lru_cache_valid_char_code_point_match.txt		profiling_results_lru_cache_valid_char_code_point_match.txt
profiling_results_lru_cache_valid_char_code_point_match_&_decode_utf8.txt		profiling_results_lru_cache_valid_char_code_point_match_&_decode_utf8.txt
profiling_results_lru_cache_valid_char_code_point_match_&_decode_utf8_trie.txt		profiling_results_lru_cache_valid_char_code_point_match_&_decode_utf8_trie.txt
profiling_results_lru_cache_valid_char_code_point_match_&_decode_utf8_trie_per_stack_cache.txt		profiling_results_lru_cache_valid_char_code_point_match_&_decode_utf8_trie_per_stack_cache.txt
profiling_results_lru_cache_valid_char_code_point_match_&_decode_utf8_trie_per_stack_cache___consume_char_code_point.txt		profiling_results_lru_cache_valid_char_code_point_match_&_decode_utf8_trie_per_stack_cache___consume_char_code_point.txt
profiling_results_no_constraint.txt		profiling_results_no_constraint.txt
profiling_results_no_constraint_100.txt		profiling_results_no_constraint_100.txt
requirements.txt		requirements.txt
setup.py		setup.py
xxx&_decode_utf8_trie_per_stack_cache.txt		xxx&_decode_utf8_trie_per_stack_cache.txt

Repository files navigation

🤗 Transformers CFG

Latest News

Support for Unicode(multilingual) grammars (2024-02-29) Integration with Text-Generation-WebUI (2023-12-17)

We are thrilled to announce that transformers_cfg has been used in the Text-Generation-WebUI project. This integration enables users to utilize our CFG capabilities within the popular, 30.5K-starred web interface for text generation. For more details, see Relevent Pull Request

Introduction

transformers_cfg is an extension library for the popular Transformers library by Hugging Face, tailored for working with context-free grammars (CFG). This package provides additional tools and functionalities to enhance your experience with natural language processing tasks involving CFGs.

It was initially developed as a pull request to the Hugging Face Transformers library. See relevant discussion here.

Installation

pip install transformers-cfg

QuickStart: Force LLM to generate a valid json object

The below example can be found in examples/generate_json.py

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
from transformers_cfg.grammar_utils import IncrementalGrammarConstraint
from transformers_cfg.generation.logits_process import GrammarConstrainedLogitsProcessor

if __name__ == "__main__":
    # Detect if GPU is available, otherwise use CPU
    device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
    print(f"Using device: {device}")

    model_id = "mistralai/Mistral-7B-v0.1"

    # Load model and tokenizer
    tokenizer = AutoTokenizer.from_pretrained(model_id)
    tokenizer.pad_token = tokenizer.eos_token

    model = AutoModelForCausalLM.from_pretrained(model_id).to(
        device
    )  # Load model to defined device
    model.generation_config.pad_token_id = model.generation_config.eos_token_id

    # Load json grammar
    with open("examples/grammars/json.ebnf", "r") as file:
        grammar_str = file.read()
    grammar = IncrementalGrammarConstraint(grammar_str, "root", tokenizer)
    grammar_processor = GrammarConstrainedLogitsProcessor(grammar)

    # Generate
    prefix1 = "This is a valid json string for http request:"
    prefix2 = "This is a valid json string for shopping cart:"
    input_ids = tokenizer([prefix1, prefix2], add_special_tokens=False, return_tensors="pt", padding=True)["input_ids"]

    output = model.generate(
        input_ids,
        max_length=50,
        logits_processor=[grammar_processor],
        repetition_penalty=1.1,
        num_return_sequences=1,
    )
    # decode output
    generations = tokenizer.batch_decode(output, skip_special_tokens=True)
    print(generations)

    """
    'This is a valid json string for http request:{ "request": { "method": "GET", "headers": [], "content": "Content","type": "application" }}
    'This is a valid json string for shopping cart:{ "name": "MyCart", "price": 0, "value": 1 }
    """

Why should I use transformers-CFG?

We support EBNF grammar description format
We offer the same grammar interface as llama-cpp project, allowing you to drop-in replace llama-cpp with transformers-CFG.
We allow you to use any of the models in the 🤗 Transformers library, including the ones that are not supported by llama-cpp.
We support multilingual grammars, you can use any character from any language in your grammar, e.g. 中文, 日本語, 한국어, हिन्दी, العربية, עברית, or emoji 🤗.

What is grammar ?

TL;DR: Think of it as an enhanced version of regular expressions.

Here is an example of a simplified JSON grammar:

# A JSON object is the root of the grammar
root ::= object

# An object starts with "{" and ends with "}" and contains pairs separated by ","
object ::= "{" pair ("," pair)* "}"

# A pair is a string followed by a ":" and a value
pair ::= string ":" value

# A string is a sequence of alphanumeric characters enclosed in double quotes
string ::= '"' [a-zA-Z0-9]* '"'

# A value can be a string, another object, or a boolean value
value ::= string | object | "true" | "false" | "null"

This grammar describes the structure of a JSON object. It specifies that a JSON object is a pair of key-value pairs, where the key is a string and the value can be a string, another object, or a boolean value.

Grammar doesn't need to be complicated. You can use it to describe very simple but useful things, like a valid email address, a valid URL, or phone number.

phone_number ::= "+" [0-9]+

You can also force it to generate only emojis or generate only korean characters.

['Describe your feeling with emoji: 🙌🙂😍😯😅🙏🙇🙈🙊🙋🙃🙆🙅🙄🙁🙂🙀🙉🙎🙊🙋🙃🙆🙅🙄🙁🙂🙀🙉🙎🙊🙋🙃🙆', 'Write a poem with emoji: 🙏😍🙏🙏🙌🙏🙏🙏🙏😁😅🙏🙏🙏🙏🙏🙏🙇🙏🙏🙏🙏🙏🙏🙏🙏🙏🙋🙏🙏🙏🙏🙏🙏']

More details can be found in this doc from llama-cpp Advanced grammar debugging guide can be found here

Automatic Grammar Generation

Here is an awesome tool to generate grammars for you: Grammar Builder

Grammar Collection

We provide a collection of grammars in the examples/grammars folder, which are mostly identical to the grammars in llama-cpp project. We try to keep the grammars up-to-date with the original grammars from llama-cpp project. But up to now, we can not yet guarantee that all grammars from llama-cpp project can be directly used in transformers-CFG.

The list of grammars contains:

json.ebnf: A grammar for generating valid json objects.
json_arr.ebnf: A grammar for generating valid json arrays.
c.ebnf: A grammar for generating valid C programs.
chess.ebnf: A grammar for generating valid chess moves.
arithmetic.ebnf: A grammar for generating valid arithmetic expressions.

Supported Models

See supported_models.yaml for the full list of supported models.

As a rule of thumb, all models with the same tokenizer should naturally be supported. If you find any model that is not supported, please open an issue or submit a pull request.

Citation

Please consider citing our work, if you found the provided resources useful.

@inproceedings{geng-etal-2023-grammar,
	title        = {Grammar-Constrained Decoding for Structured {NLP} Tasks without Finetuning},
	author       = {Geng, Saibo  and Josifoski, Martin  and Peyrard, Maxime  and West, Robert},
	year         = 2023,
	month        = dec,
	booktitle    = {Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing},
	publisher    = {Association for Computational Linguistics},
	address      = {Singapore},
	url          = {https://aclanthology.org/2023.emnlp-main.674},
	editor       = {Bouamor, Houda  and Pino, Juan  and Bali, Kalika}
}

License

This project is licensed under the MIT License.

Acknowledgement

This project is derived from the torch-grammars project, which was derived from the llama-cpp project.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤗 Transformers CFG

Latest News

Introduction

Installation

QuickStart: Force LLM to generate a valid json object

Why should I use transformers-CFG?

What is grammar ?

Automatic Grammar Generation

Grammar Collection

Supported Models

Citation

License

Acknowledgement

About

Releases 8

Packages

Contributors 10

Languages

License

epfl-dlab/transformers-CFG

Folders and files

Latest commit

History

Repository files navigation

🤗 Transformers CFG

Latest News

Introduction

Installation

QuickStart: Force LLM to generate a valid json object

Why should I use transformers-CFG?

What is grammar ?

Automatic Grammar Generation

Grammar Collection

Supported Models

Citation

License

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases 8

Packages 0

Contributors 10

Languages

Packages