This Streamlit app provides a summary of Site Wang's learnings from GTC 2024.
Deployed at streamlit community cloud
- Switch to a new virtualenv
- Install dependencies:
pip install -r requirements.txt
- Make sure you have env var set LLM calls
OPENAI_API_KEY
for OpenAI callsAPI_KEY
for Google's Gemini
- Kick off the app by doing
streamlit run gtc_summary.py
Generally, the spending of all models below shouldn't be more than $0.1 for 1 day of extensive tuning/optimization.
- Embedding Model: OpenAI's embedding with interface from LlamaIndex
- RAG: LlamaIndex's RAG model with GPT3.5Turbo + OpenAI Embedding
- Keyword Search: OpenAI's embedding
- Company Search: Google's Gemini 1.5Pro
- Code Generation
- Blockers: GPT-3.5-Turbo
- Code Planner: GPT-3.5-Turbo
- Code Writer: Google Gemini
- Code Reviewer: GPT-4-Turbo
Please try not to hack it. Trust me you won't get much. If you are successful, I appreciate let me know. I would love to learn from you and block your hack.