Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
bot		bot
data		data
etl		etl
setup		setup
tasks		tasks
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
.shellcheckrc		.shellcheckrc
Makefile		Makefile
README.md		README.md
Running ETL to Build the Document Corpus.ipynb		Running ETL to Build the Document Corpus.ipynb
app.py		app.py
docstore.py		docstore.py
logo.txt		logo.txt
prompts.py		prompts.py
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
utils.py		utils.py
vecstore.py		vecstore.py

Repository files navigation

🥞🦜 askFSDL 🦜🥞

askFSDL is a demonstration of a retrieval-augmented question-answering application.

You can try it out via the Discord bot frontend in the Full Stack Discord!

We use our educational materials as a corpus: the Full Stack LLM Bootcamp, the Full Stack Deep Learning course, and the Opinionated LLM++ Lit Review.

So the resulting application is great at answering questions like

Which is cheaper: running experiments on cheap, slower GPUs or fast, more expensive GPUs?
How do I build an ML team?
What's a data flywheel?
Should I use a dedicated vector store for my embeddings?
What is zero-shot chain-of-thought reasoning?

Stack

We use langchain to organize our LLM invocations and prompt magic.

We stood up a MongoDB instance on Atlas to store our cleaned and organized document corpus. See the Running ETL to Build the Document Corpus notebook for details.

For fast search of relevant documents to insert into our prompt, we use a FAISS index.

We host the application backend on Modal, which provides serverless execution and scaling. That's also where we execute batch jobs, like writing to the document store and refreshing the vector index.

For creating a simple user interface in pure Python, we use Gradio. This UI is great for quick tests without deploying a full frontend but with a better developer experience than curl-ing from the command line.

We host the Discord bot, written in discord.py, on a free-tier AWS EC2 instance, which we provision and configure with Pulumi.

We use Gantry to monitor model behvaior in production and collect feedback from users.

Run it yourself

Thanks to community contributions, we can share a best-effort guide to running the application yourself here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🥞🦜 askFSDL 🦜🥞

Stack

Run it yourself

About

Releases

Packages

Languages

sidsharma22/ask-fsdl

Folders and files

Latest commit

History

Repository files navigation

🥞🦜 askFSDL 🦜🥞

Stack

Run it yourself

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages