π Hi there! I'm Pramod Toraskar, a highly skilled Principal Data Engineer at Red Hat, with over 14 years of experience in IT. I specialize in Data Engineering, AI, and Machine Learning, driving digital transformations through cutting-edge cloud technologies and data pipelines. My technical expertise and leadership have consistently delivered innovative and scalable solutions across multiple industries.
I have built my career working across various data-centric roles, with a focus on:
- Data Engineering: Designing robust, scalable ETL pipelines, implementing data architectures, and automating workflows to streamline business processes.
- Cloud & Big Data Solutions: Extensive experience working with AWS (S3, EC2, RDS), Snowflake, Starburst, and various data warehousing and cloud platforms.
- AI & Machine Learning: Proven expertise in integrating ML models into production environments and creating AI-driven solutions for data insights and process optimization.
- Cross-Functional Collaboration: Working closely with cross-functional teams, including Marketing, Sales, and IT, to drive data strategy and deliver business value.
- Programming: Python, SQL, JavaScript, Shell scripting
- Cloud Platforms: AWS (S3, EC2, RDS), GitLab, GitLab Innersource
- Data Tools: Snowflake, Fivetran, dbt, Apache Airflow (Astrocloud), Starburst
- Marketing & Sales Tools: Marketo, Salesforce, HG Insights, Datorama, Bombora, Eloqua, Outreach
- Development & Collaboration: CI/CD, DevOps, Jira, Confluence, Slack
- AI & Machine Learning: Working with large language models (LLMs), developing machine learning models, and integrating them into data products.
- Tech Stack: Fivetran, dbt, Snowflake
- Description: Built a data pipeline to sync marketing data from Marketo into Snowflake using Fivetran and transformed it with dbt for optimized analytics and reporting.
- Tech Stack: GitLab, Git, CI/CD
- Description: Led the migration of Git repositories and CI/CD pipelines from legacy platforms to GitLab CEE, ensuring a smooth transition and resolving blockers related to CI/CD configurations.
- Tech Stack: AWS, Adobe Audience Manager, Starburst
- Description: Developed an ETL pipeline to pull data from Starburst tables, format it, and push to Adobe Audience Manager S3 buckets, eliminating dependency on legacy systems.
- Tech Stack: Snowflake, dbt, Marketo, Salesforce, HG Insights
- Description: Designed and developed source-line and aggregated data products, integrating multiple third-party data sources into Snowflake for advanced analytics.
- Repository: Fluvii GitHub Repo
- Description: Contributed to the development and maintenance of Fluvii, a powerful data management solution used for streamlining data operations.
- Repository: qlikreader GitHub Repo
- Description: Built a tool to read and process data from Qlik dashboards, simplifying data extraction for business intelligence purposes.
- Repository: python-outreach GitHub Repo
- Description: Developed a Python-based solution to interact with the Outreach API for automating CRM tasks and managing sales data efficiently.
- Repository: dwm GitHub Repo
- Description: Customized and extended the dynamic window manager (dwm) for personal productivity and optimized workflow in a Unix environment.
- Rotating Facilitator - Sprint 7, Q4 2024: Demonstrated leadership in sprint facilitation to drive effective communication and goal achievement across teams.
- CI/CD Optimizations: Improved CI/CD pipelines for GitLab and various marketing-related projects, ensuring efficient deployment processes and reduced system downtimes.
- Data Governance Initiatives: Worked on significant governance projects related to personal identifiable information (PII) migration to non-production environments, ensuring compliance and data integrity.
I am committed to continuous innovation and expanding my expertise in:
- Advanced AI and Quantum Computing: Exploring the intersection of AI with quantum technologies to drive the future of data science.
- Data Science and Machine Learning: Deepening my knowledge in machine learning applications for predictive analytics and automation.
- Leadership & Patents: Aiming to lead groundbreaking projects in data-driven digital transformation and obtain patents for innovative solutions in AI and data engineering.
- Email: pramodtoraskar@ptoraska-mac
- LinkedIn: linkedin.com/in/pramodtoraskar
- GitHub: github.com/pramodtoraskar
Thank you for visiting my GitHub! Feel free to reach out for potential collaborations, mentorship, or just to talk about data and AI!