GitHub - joaonadkarni/ecco at b980feebab98a5410fd6417faeced1570b9c869b

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 257 Commits
.github/workflows		.github/workflows
docs		docs
notebooks		notebooks
src/ecco		src/ecco
tests		tests
.bumpversion.cfg		.bumpversion.cfg
.cookiecutterrc		.cookiecutterrc
.coveragerc		.coveragerc
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yml		.readthedocs.yml
AUTHORS.rst		AUTHORS.rst
CHANGELOG.rst		CHANGELOG.rst
CONTRIBUTING.rst		CONTRIBUTING.rst
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.rst		README.rst
mkdocs.yml		mkdocs.yml
readme.md		readme.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Repository files navigation

Ecco is a python library for exploring and explaining Natural Language Processing models using interactive visualizations.

Ecco provides multiple interfaces to aid the explanation and intuition of Transformer-based language models. Read: Interfaces for Explaining Transformer Language Models.

Ecco runs inside Jupyter notebooks. It is built on top of pytorch and transformers.

Ecco is not concerned with training or fine-tuning models. Only exploring and understanding existing pre-trained models. The library is currently an alpha release of a research project. You're welcome to contribute to make it better!

Documentation: ecco.readthedocs.io

Examples:

What is the sentiment of this review?

Use a large language model (T5 in this case) to detect text sentiment. In addition to the sentiment, see the tokens the model broke the text into (which can help debug some edge cases).

Which words in this review lead the model to classify its sentiment as "negative"?

Feature attribution using Integrated Gradients helps you explore model decisions. In this case, switching "weakness" to "inclination" allows the model to correctly switch the prediction to positive.

Explore the world-knowedge of GPT models by posing fill-in-the blank questions.

Does GPT2 know where Heathrow Airport is?

What other cities/words did the model consider in addition to London?

Visuals the candidate output tokens and their probability scores.

Which input words lead it to think of London?

At which layers did the model gather confidence that London is the right answer?

The model chose London by making the highest probability token (ranking it #1) after the last layer in the model. How much did each layer contribute to increasing the ranking of London? This is a logit lens visualizations that helps explore the activity of different model layers.

What are the patterns in BERT neuron activation when ir processes a piece of text?

A group of neurons in BERT tend to fire in response to commas and other punctuation. Other groups of neurons tend to fire in response to pronouns. Use this visualization to factorize neuron activity in individual FFNN layers or in the entire model.

Read the paper:

Ecco: An Open Source Library for the Explainability of Transformer Language Models Association for Computational Linguistics (ACL) System Demonstrations, 2021

Tutorials

Video: Take A Look Inside Language Models With Ecco. [Colab Notebook]

How-to Guides

API Reference

The API reference and the architecture page explain Ecco's components and how they work together.

Gallery & Examples

Predicted Tokens: View the model's prediction for the next token (with probability scores). See how the predictions evolved through the model's layers. [Notebook] [Colab]

Rankings across layers: After the model picks an output token, Look back at how each layer ranked that token. [Notebook] [Colab]

Layer Predictions:Compare the rankings of multiple tokens as candidates for a certain position in the sequence. [Notebook] [Colab]

Primary Attributions: How much did each input token contribute to producing the output token? [Notebook] [Colab]

Detailed Primary Attributions: See more precise input attributions values using the detailed view. [Notebook] [Colab]

Neuron Activation Analysis: Examine underlying patterns in neuron activations using non-negative matrix factorization. [Notebook] [Colab]

Getting Help

Having trouble?

The Discussion board might have some relevant information. If not, you can post your questions there.
Report bugs at Ecco's issue tracker

Bibtex for citations:

@inproceedings{alammar-2021-ecco,
    title = "Ecco: An Open Source Library for the Explainability of Transformer Language Models",
    author = "Alammar, J",
    booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: System Demonstrations",
    year = "2021",
    publisher = "Association for Computational Linguistics",
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Examples:

What is the sentiment of this review?

Which words in this review lead the model to classify its sentiment as "negative"?

Explore the world-knowedge of GPT models by posing fill-in-the blank questions.

What other cities/words did the model consider in addition to London?

Which input words lead it to think of London?

At which layers did the model gather confidence that London is the right answer?

What are the patterns in BERT neuron activation when ir processes a piece of text?

Tutorials

How-to Guides

API Reference

Gallery & Examples

Getting Help

About

Releases

Packages

Languages

License

joaonadkarni/ecco

Folders and files

Latest commit

History

Repository files navigation

Examples:

What is the sentiment of this review?

Which words in this review lead the model to classify its sentiment as "negative"?

Explore the world-knowedge of GPT models by posing fill-in-the blank questions.

What other cities/words did the model consider in addition to London?

Which input words lead it to think of London?

At which layers did the model gather confidence that London is the right answer?

What are the patterns in BERT neuron activation when ir processes a piece of text?

Tutorials

How-to Guides

API Reference

Gallery & Examples

Getting Help

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages