LLM (Large Language Models) FineTuning Projects and notes on common practical techniques

Find me here..

🐦 TWITTER: https://twitter.com/rohanpaul_ai
🟠 YouTube: https://www.youtube.com/@RohanPaul-AI/featured
👨🏻‍💼 LINKEDIN: https://www.linkedin.com/in/rohan-paul-b27285129/
👨‍🔧 KAGGLE: https://www.kaggle.com/paulrohan2020

Fine-tuning LLM (and YouTube Video Explanations)

Notebook	🟠 YouTube Video
Finetune Llama-3-8B with unsloth 4bit quantized with ORPO
Llama-3 Finetuning on custom dataset with unsloth
CodeLLaMA-34B - Conversational Agent
Inference Yarn-Llama-2-13b-128k with KV Cache to answer quiz on very long textbook
Mistral 7B FineTuning with_PEFT and QLORA
Falcon finetuning on openassistant-guanaco
Fine Tuning Phi 1_5 with PEFT and QLoRA
Web scraping with Large Language Models (LLM)-AnthropicAI + LangChainAI

Fine-tuning LLM

Notebook	Colab
📌 Gemma_2b_finetuning_ORPO_full_precision
📌 Jamba_Finetuning_Colab-Pro
📌 Finetune codellama-34B with QLoRA
📌 Mixtral Chatbot with Gradio
📌 togetherai api to run Mixtral
📌 Integrating TogetherAI with LangChain 🦙
📌 Mistral-7B-Instruct_GPTQ - Finetune on finance-alpaca dataset 🦙
📌 Mistral 7b FineTuning with DPO Direct_Preference_Optimization
📌 Finetune llama_2_GPTQ
📌 TinyLlama with Unsloth and_RoPE_Scaling dolly-15 dataset
📌 Tinyllama fine-tuning with Taylor_Swift Song lyrics

LLM Techniques and utils - Explained

LLM Concepts
📌 DPO (Direct Preference Optimization) training and its datasets
📌 4-bit LLM Quantization with GPTQ
📌 Quantize with HF Transformers
📌 Understanding rank r in LoRA and related Matrix_Math
📌 Rotary Embeddings (RopE) is one of the Fundamental Building Blocks of LlaMA-2 Implementation
📌 Chat Templates in HuggingFace
📌 How is Mixtral 8x7B is a dense 47Bn param model
📌 The concept of validation log perplexity in LLM training - a note on fundamentals.
📌 Why we need to identify `target_layers` for LoRA/QLoRA
📌 Evaluate Token per sec
📌 traversing through nested attributes (or sub-modules) of a PyTorch module
📌 Implementation of Sparse Mixtures-of-Experts layer in PyTorch from Mistral Official Repo
📌 Util method to extract a specific token's representation from the last hidden states of a transformer model.
📌 Convert PyTorch model's parameters and tensors to half-precision floating-point format
📌 Quantizing 🤗 Transformers models with the GPTQ method
📌 Quantize Mixtral-8x7B so it can run in 24GB GPU
📌 What is GGML or GGUF in the world of Large Language Models ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM (Large Language Models) FineTuning Projects and notes on common practical techniques

Find me here..

Fine-tuning LLM (and YouTube Video Explanations)

Fine-tuning LLM

LLM Techniques and utils - Explained

Other Smaller Language Models

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Finetune_llama_2_GPTQ		Finetune_llama_2_GPTQ
Mixtral_Chatbot_with_Gradio		Mixtral_Chatbot_with_Gradio
Quantize_with_HF_transformers		Quantize_with_HF_transformers
assets		assets
gemma-2b_ORPO_FineTuning_full_precision		gemma-2b_ORPO_FineTuning_full_precision
.gitignore		.gitignore
4-bit_LLM_Quantization_with_GPTQ.ipynb		4-bit_LLM_Quantization_with_GPTQ.ipynb
Add-task_specific_custom_layer_to_model.ipynb		Add-task_specific_custom_layer_to_model.ipynb
BERT-B~1.IPY		BERT-B~1.IPY
BERT_HuggingFace_Basic_Usages.ipynb		BERT_HuggingFace_Basic_Usages.ipynb
Cerebras-gpt-13B.ipynb		Cerebras-gpt-13B.ipynb
CodeLLaMA_34B_Conversation_with_Streamlit.py		CodeLLaMA_34B_Conversation_with_Streamlit.py
Convert_Pytorch_model_to_half_precision.ipynb		Convert_Pytorch_model_to_half_precision.ipynb
Cosine_Similarity_between_sentences_with_Transformers.ipynb		Cosine_Similarity_between_sentences_with_Transformers.ipynb
DPOTrainer.ipynb		DPOTrainer.ipynb
DeBERTa_Fine_Tuning-for_Amazon_Review_Dataset_Pytorch.ipynb		DeBERTa_Fine_Tuning-for_Amazon_Review_Dataset_Pytorch.ipynb
Evaluate_token_per_sec.ipynb		Evaluate_token_per_sec.ipynb
Fake_News_Classification_with_LSTM_Tensorflow.ipynb		Fake_News_Classification_with_LSTM_Tensorflow.ipynb
Falcon-7B_FineTuning_with_PEFT_and_QLORA.ipynb		Falcon-7B_FineTuning_with_PEFT_and_QLORA.ipynb
FinBERT_Long_Text.ipynb		FinBERT_Long_Text.ipynb
FinBERT_Long_Text_Part_2.ipynb		FinBERT_Long_Text_Part_2.ipynb
FineTuning_phi-1_5_with_PRFT_LoRA.ipynb		FineTuning_phi-1_5_with_PRFT_LoRA.ipynb
Fine_Tuning_DistilBert_Poem_Sentiments.ipynb		Fine_Tuning_DistilBert_Poem_Sentiments.ipynb
Fine_Tuning_Pegasus_for_Text_Summarization.ipynb		Fine_Tuning_Pegasus_for_Text_Summarization.ipynb
Finetune_codellama-34B-with-QLoRA.ipynb		Finetune_codellama-34B-with-QLoRA.ipynb
Finetune_opt_bnb_peft.ipynb		Finetune_opt_bnb_peft.ipynb
Fuzzy-String-Matching.ipynb		Fuzzy-String-Matching.ipynb
GGUF_GGML_GPTQ-basics.md		GGUF_GGML_GPTQ-basics.md
Inference_Yarn-Llama-2-13b-128k_Github.ipynb		Inference_Yarn-Llama-2-13b-128k_Github.ipynb
Jamba_Finetuning_Colab-Pro.ipynb		Jamba_Finetuning_Colab-Pro.ipynb
LlaMa-2-FineTuning.ipynb		LlaMa-2-FineTuning.ipynb
Llama-3_Finetuning_on_custom_dataset_with_unsloth.ipynb		Llama-3_Finetuning_on_custom_dataset_with_unsloth.ipynb
Llama_3_Finetuning_ORPO_with_Unsloth.ipynb		Llama_3_Finetuning_ORPO_with_Unsloth.ipynb
Local-Inferencing_LlaMa-2.ipynb		Local-Inferencing_LlaMa-2.ipynb
Mistral-7B-Inferencing.ipynb		Mistral-7B-Inferencing.ipynb
Mistral_7B_Instruct_GPTQ_finetune.ipynb		Mistral_7B_Instruct_GPTQ_finetune.ipynb
Mistral_7b_FineTuning_with_DPO_Direct_Preference_Optimization.ipynb		Mistral_7b_FineTuning_with_DPO_Direct_Preference_Optimization.ipynb
Mistral_FineTuning_with_PEFT_and_QLORA.ipynb		Mistral_FineTuning_with_PEFT_and_QLORA.ipynb
MoE_implementation_Mistral_official_Repo.ipynb		MoE_implementation_Mistral_official_Repo.ipynb
Multi-class-text-classifica_fine-tuning-distilbert.ipynb		Multi-class-text-classifica_fine-tuning-distilbert.ipynb
NAMED_~1.IPY		NAMED_~1.IPY
Nous-Hermes-2-Yi-34B-GGUF_in_Kaggle_free_GPU_with_llama_cpp.ipynb		Nous-Hermes-2-Yi-34B-GGUF_in_Kaggle_free_GPU_with_llama_cpp.ipynb
Quantize_mixtral-instruct-awq_in_so_it_can_run_in_24GB.ipynb		Quantize_mixtral-instruct-awq_in_so_it_can_run_in_24GB.ipynb
Quantizing_Transformers_with_GPTQ.ipynb		Quantizing_Transformers_with_GPTQ.ipynb
README.md		README.md
RoPE-As-Implemented-in-LlaMa-Source-Code.ipynb		RoPE-As-Implemented-in-LlaMa-Source-Code.ipynb
Select_last_meaningful_token_from_each_sequence.ipynb		Select_last_meaningful_token_from_each_sequence.ipynb
Text_Analytics_of_Tweet_Emotion_EDA_with_Plotly.ipynb		Text_Analytics_of_Tweet_Emotion_EDA_with_Plotly.ipynb
Text_Summarization_BART_T5_Pegasus.ipynb		Text_Summarization_BART_T5_Pegasus.ipynb
TinyLlama_with_Unsloth_and_RoPE_Scaling_dolly-15k.ipynb		TinyLlama_with_Unsloth_and_RoPE_Scaling_dolly-15k.ipynb
TogetherAI_API_with_LangChain.ipynb		TogetherAI_API_with_LangChain.ipynb
Topic_Modeling_with_LDA.ipynb		Topic_Modeling_with_LDA.ipynb
Traverse_through_sub-modules_of_PyTorch_Model.ipynb		Traverse_through_sub-modules_of_PyTorch_Model.ipynb
Understanding_rank_r_in_LoRA_and_related_Matrix_Math.ipynb		Understanding_rank_r_in_LoRA_and_related_Matrix_Math.ipynb
Validation_log_perplexity.md		Validation_log_perplexity.md
Web_scraping_with_Large_Language_Models_LLM_AnthropicAI_LangChainAI.ipynb		Web_scraping_with_Large_Language_Models_LLM_AnthropicAI_LangChainAI.ipynb
Word-Vectors-Understanding-with-Spacy.ipynb		Word-Vectors-Understanding-with-Spacy.ipynb
YT_Fine_tuning_BERT_NER_v1.ipynb		YT_Fine_tuning_BERT_NER_v1.ipynb
Zero_Shot_Learning_multilingual-NER.ipynb		Zero_Shot_Learning_multilingual-NER.ipynb
apply_chat_template.ipynb		apply_chat_template.ipynb
enable_json_mode.ipynb		enable_json_mode.ipynb
layered_inference_with_airllm_70B_LLM_Inference_on_a_Single_4GB_GPU.ipynb		layered_inference_with_airllm_70B_LLM_Inference_on_a_Single_4GB_GPU.ipynb
sentiment_analysis_textblob_Vader.ipynb		sentiment_analysis_textblob_Vader.ipynb
targets_layers_in_peft_and_meaning_of_Rank_of_a_Matrix.ipynb		targets_layers_in_peft_and_meaning_of_Rank_of_a_Matrix.ipynb
tinyllama_fine-tuning_Taylor_Swift.ipynb		tinyllama_fine-tuning_Taylor_Swift.ipynb
togetherai-api-with_Mixtral.ipynb		togetherai-api-with_Mixtral.ipynb

shuaicui0607/llm

Folders and files

Latest commit

History

Repository files navigation

LLM (Large Language Models) FineTuning Projects and notes on common practical techniques

Find me here..

Fine-tuning LLM (and YouTube Video Explanations)

Fine-tuning LLM

LLM Techniques and utils - Explained

Other Smaller Language Models

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages