-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
5 changed files
with
184 additions
and
13 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -165,3 +165,4 @@ cython_debug/ | |
#.idea/ | ||
|
||
poetry.lock | ||
db/ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,83 @@ | ||
# Interview Preparation Document for Kevin Vegda | ||
|
||
**Job Title:** Data Scientist at Depop | ||
|
||
**Interview Preparation:** | ||
|
||
## Key Interview Questions and Talking Points | ||
|
||
### 1. Professional Experience | ||
- **Question:** Can you describe your experience at Flagright as a Machine Learning Engineer intern and the impact you had? | ||
- **Talking Points:** | ||
- Discuss setting up the MLOps pipeline for fraud detection. | ||
- Highlight the integration of models for fraud detection using AWS services. | ||
- Mention the improvement in client feedback by approximately 10%. | ||
|
||
- **Question:** What were your main responsibilities and achievements at Sprinklr? | ||
- **Talking Points:** | ||
- Describe shipping features like the modal notification system and the automatic scheduler. | ||
- Emphasize fixing high-priority same-day bugs. | ||
- Mention the recognition as a top performer in the Engineering team for two quarters. | ||
- Highlight the increase in average user engagement by 7%. | ||
|
||
### 2. Technical Skills | ||
- **Question:** Can you elaborate on your proficiency with Python and how you've applied it in your previous roles? | ||
- **Talking Points:** | ||
- Discuss using Python for developing models and setting up MLOps pipelines. | ||
- Mention specific libraries and frameworks used, such as PyTorch and TensorFlow. | ||
|
||
- **Question:** How have you utilized AI frameworks and tools like PyTorch and TensorFlow in your projects? | ||
- **Talking Points:** | ||
- Provide examples from your projects like the Automatic Speech Recognition and Python Code Autocomplete. | ||
- Discuss specific techniques and algorithms used, such as CNNs, RNNs, and Transformer architectures. | ||
|
||
### 3. Machine Learning and Data Science Expertise | ||
- **Question:** What experience do you have with designing and conducting large-scale experiments? | ||
- **Talking Points:** | ||
- Mention your thesis on Abstractive Document Summarisation and the experiments conducted to achieve performance gains. | ||
- Discuss any A/B testing or hypothesis testing conducted during your internship or projects. | ||
|
||
- **Question:** How do you stay up-to-date with the latest research and advancements in machine learning? | ||
- **Talking Points:** | ||
- Highlight your continuous learning through courses, reading research papers, and participating in relevant communities or conferences. | ||
|
||
### 4. Collaborative and Interdisciplinary Work | ||
- **Question:** Can you share an example of a time you worked with cross-functional teams? | ||
- **Talking Points:** | ||
- Discuss your experience at Sprinklr working in an Agile environment and collaborating with different teams. | ||
- Highlight your interdisciplinary communication skills and any specific instances where these were crucial. | ||
|
||
### 5. Specific Job Role Related | ||
- **Question:** What attracts you to the Data Scientist role at Depop, and how do you see yourself contributing to our mission? | ||
- **Talking Points:** | ||
- Express your enthusiasm for Depop’s mission of promoting circular fashion and sustainability. | ||
- Relate your skills and experiences to the responsibilities listed in the job description, such as delivering machine learning solutions for search and discovery. | ||
|
||
- **Question:** How would you approach designing a machine learning solution to show users personalized results in real-time? | ||
- **Talking Points:** | ||
- Outline a high-level approach, including understanding user behavior, data preprocessing, model selection, and real-time inference. | ||
- Mention any relevant experience with similar problems, such as your work on fraud detection or user engagement features. | ||
|
||
### 6. Additional Skills and Bonus Points | ||
- **Question:** Do you have experience with search or recommendation models? | ||
- **Talking Points:** | ||
- Discuss any relevant coursework or projects, and how the skills gained can be applied to Depop’s needs. | ||
- Mention your continuous learning efforts to stay updated with these areas. | ||
|
||
- **Question:** Can you elaborate on your experience with cloud platforms like AWS? | ||
- **Talking Points:** | ||
- Provide details from your internship at Flagright, where you used AWS services like SageMaker, S3, API Gateway, and Docker. | ||
- Highlight any specific achievements or efficiencies gained through the use of AWS. | ||
|
||
### Personal and Soft Skills | ||
- **Question:** How do you ensure effective communication of technical findings to non-technical audiences? | ||
- **Talking Points:** | ||
- Discuss your experience in technical writing and documentation. | ||
- Provide an example of a complex technical concept you successfully communicated to a non-technical audience. | ||
|
||
- **Question:** What motivates you to work in the field of data science and machine learning? | ||
- **Talking Points:** | ||
- Share your passion for solving complex problems and staying updated with the latest advancements in the field. | ||
- Mention any specific areas of interest, such as NLP or AI, and how they align with the role at Depop. | ||
|
||
By preparing these questions and talking points, Kevin can effectively highlight his strengths and relevance to the Data Scientist role at Depop. This preparation will help him confidently navigate the interview process and demonstrate his suitability for the position. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,69 @@ | ||
# Kevin Vegda | ||
+91-7984503748 • [email protected] • linkedin.com/in/kevin-vegda/ • kevinv.vercel.app | ||
|
||
## Summary | ||
Results-driven Data Scientist with a solid track record of delivering machine learning solutions to solve industry-scale problems. Proficient in Python and experienced with frameworks such as sci-kit-learn, TensorFlow, and PyTorch. Adept at setting up machine learning pipelines and conducting large-scale experiments. Passionate about staying updated with the latest research and applying new techniques for NLP and image processing. Collaborative team player with excellent communication skills, ready to contribute to Depop’s mission of building a diverse, progressive home of fashion. | ||
|
||
## Professional Experience | ||
|
||
### Flagright: Machine Learning Engineer (Intern) (04/2023 - 06/2023) | ||
- Set up an MLOps pipeline for fraud detection, including CI/CD, models, and monitoring dashboards using infrastructure-as-code in AWS CDK with CloudFormation stacks. | ||
- Integrated various models for fraud detection in financial data using serverless services on AWS such as SageMaker, S3, API Gateway, and Docker. | ||
- Increased positive client feedback on fraud detection by ~10%. | ||
- **Skills**: Machine Learning, MLOps, Cloud (AWS), Docker, Technical writing | ||
|
||
### Sprinklr: Product Engineer (11/2019 - 06/2021) | ||
- Shipped features including a modal notification system and an automatic scheduler to maximize user engagement. | ||
- Fixed high-priority same-day bugs affecting multi-million dollar clients. | ||
- Won top performer in the Engineering team for two quarters in 2020. | ||
- Increased average user engagement by 7%. | ||
- **Skills**: Large codebase management, regular code review, CI/CD practices, scalable and readable code, Agile environment, React.js, Design Systems, Interdisciplinary Communication | ||
|
||
## Technical Skills | ||
- **Programming Languages**: Python, R, JavaScript, React.js | ||
- **AI Frameworks and Tools**: PyTorch, PyTorch Lightning, LangChain, CrewAI | ||
- **Miscellaneous**: Docker, Jira, Git, Unix, AWS, Azure | ||
|
||
## Education and Certification | ||
|
||
### MSc Data Science and Analytics (Jun 2023) | ||
- **Institution**: Royal Holloway, University of London | ||
- **Distinction**: 8.63/10 | ||
- **Courses**: Natural Language Processing, Deep Learning, Large-scale Data Processing, Data-Centric AI | ||
- **Thesis**: Abstractive Document Summarisation with Pointer-Generator Networks | ||
- Achieved 2-3 ROUGE points of performance gains for abstractive document summarisation | ||
- Applied Attention mechanism to RNN-based Encoder-Decoders models | ||
- Added Pointer-Generator mechanism to Transformer-based models | ||
|
||
### BTech in Information and Communication Technology (Jan 2019) | ||
- **Institution**: Dhirubhai Ambani Institute of Information and Communication Technology (DAIICT) | ||
- **Courses**: Advanced Topics in Neural Networks, Theory of Computation, Graph Networks | ||
|
||
## Relevant Projects | ||
|
||
### Automatic Speech Recognition (May 2024) | ||
- Trained an ASR model using CNNs and RNNs in PyTorch with the Connectionist Temporal Classification (CTC) algorithm for alignment. | ||
- Used mel-spectrograms to convert the audio domain problem into a computer vision problem. | ||
|
||
### Python Code Autocomplete (June 2024) | ||
- Developed medium-sized Language Models (~1B params) for text generation on Python code. | ||
- Fine-tuned a DeepSeek-Coder-1.3B model with hyperparameter search using Weights and Biases Sweep. | ||
- Created another code completion model (~200M params) from scratch in PyTorch using a Decoder-only Transformer architecture. | ||
|
||
## Interests | ||
|
||
- Data Science, AI, and NLP | ||
- Collaborating on Data Science projects, especially those in NLP and focusing on LLMs | ||
- Actively looking for Data Science jobs | ||
|
||
## Communication Style | ||
|
||
- Excellent communication skills, essential for working in a collaborative and interdisciplinary environment. | ||
- Experienced in technical writing and documentation. | ||
- Ability to convey complex technical information in a user-friendly manner. | ||
|
||
## Achievements and Contributions | ||
|
||
- Top performer in the Engineering team at Sprinklr for two quarters in 2020. | ||
- Increased user engagement and client satisfaction through innovative features and efficient problem-solving. | ||
- Contributed to the academic field with a thesis on abstractive summarisation using state-of-the-art language models. |