Skip to content
View khalil1604's full-sized avatar

Block or report khalil1604

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
khalil1604/README.md

Khalil Bezrati πŸ‘‹

Data Scientist

Website

About Me πŸš€

I'm a Data Scientist specializing in building end-to-end Machine Learning systems and AI applications. My expertise spans across MLOps, Natural Language Processing, and developing production-ready ML solutions.

Tools and Technologies

Python, Scikit-learn, Tensorflow, langchain, FastAPI, Flask, Mlflow, Airflow, Docker, Github actions, MongoDB, Hugging face ecosystem, AWS(EC2, ECR), Azure(WebApp, ACR), ETL, Web scraping, fine-tuning, NLP, CV

πŸ”₯ Featured Projects

Machine Learning Engineering

  1. Gemstone Price Prediction

    • End-to-end ML regression system with Azure deployment
    • Features complete pipeline from data processing to production endpoints
  2. Network Security Phishing Prediction

    • Production-ready ML system for detecting phishing attempts
  3. Wine Quality Prediction

    • Complete ML pipeline with MLflow integration
    • Experiment tracking and model versioning using dagshub

AI Engineering

  1. Natural Language Data Warehouse Query System

    • Innovative system for querying data warehouses using natural language
    • Bridges the gap between business users and complex data structures
  2. Interactive PDF Chatbot

    • RAG-based conversational system for PDF document interaction
    • Implements advanced document understanding and response generation
  3. Content Summarization Engine

    • Automated text summarization from YouTube videos and websites
    • Utilizes Langchain for efficient content processing and synthesis

Data Engineering :

  1. ETL Pipeline with Airflow and AWS
    • Airflow for managing workflows and scheduling data pipeline
    • Stores extracted data in AWS PostgreSQL

🀝 Let's Connect

Feel free to reach out if you're interested in collaborating on projects or discussing data science and AI engineering. Visit my portfolio website to learn more about my work.


πŸ’‘ "Turning Data into Insights, and Insights into Action"

Pinned Loading

  1. RAG-Interactive-PDF-Conversational-Chatbot RAG-Interactive-PDF-Conversational-Chatbot Public

    An interactive question-answering system using any LLM of your choice, allowing users to upload PDF documents and ask questions contextually based on the document content.

    Jupyter Notebook 1

  2. Search-Engine-with-Langchain-Tools-and-Agents Search-Engine-with-Langchain-Tools-and-Agents Public

    An interactive chatbot application using LangChain and Streamlit that allows users to chat with an agent capable of searching the web, querying Arxiv, and Wikipedia.

    Jupyter Notebook

  3. khalil1604-Chatbot-Code-Assistant-using-Ollama khalil1604-Chatbot-Code-Assistant-using-Ollama Public

    This project implements a chatbot using a customized codellama 7b, quantized to 4bit precision, served through Ollama API, and presented with a Gradio interface.

    Python

  4. Langchain-Summarize-Text-from-YouTube-or-Website Langchain-Summarize-Text-from-YouTube-or-Website Public

    This Streamlit app allows to summarize content from a YouTube video or a website URL using LangChain and Groq's Gemma-7b-It model.

    Jupyter Notebook