Welcome to my journey through Platform Engineering, Site Reliability Engineering (SRE), and DevOps. This repository is designed to document my learning path, projects, and notes on various essential topics, tools, and practices in the field. Whether you're a beginner or a seasoned professional, this will provide insights into mastering infrastructure, automation, and reliability in cloud-native environments.
This section outlines the various study and learning resources categorized by time, topics, and skills.
-
10 Days of Terraform
A deep dive into Infrastructure as Code (IaC) using Terraform, focusing on automating cloud infrastructure deployment. -
30 Days of Go
Learning Go programming for performance, simplicity, and scalability in cloud-based and distributed systems. -
30 Days of Python
Python-focused tutorials and projects for scripting, automation, and development in cloud environments. -
GitOps
Using Git as the single source of truth for CI/CD pipelines, infrastructure automation, and environment management.
-
Distributed Systems
Concepts and practical knowledge to design, build, and maintain distributed systems that are scalable, fault-tolerant, and maintainable. -
Kubernetes
Master container orchestration with Kubernetes, exploring concepts like clusters, pods, services, and ingress management. -
Infrastructure as Code (IaC)
Hands-on tutorials and notes on managing infrastructure through code, using tools like Terraform, CloudFormation, and Pulumi. -
Observability
Techniques for monitoring and improving system performance, including logs, metrics, traces, and setting up observability stacks. -
System Performance
Understanding and improving performance in cloud infrastructure, distributed systems, and services.
-
System Administration
Notes on Linux/Unix system administration, focusing on performance tuning, automation, and system security. -
Kubernetes Ingress
Managing traffic routing in Kubernetes clusters and ensuring services are exposed and accessible. -
Java
Leveraging Java for building scalable backend services in microservices and cloud-native environments. -
Azure
Exploring cloud solutions using Microsoft Azure, including services, tools, and DevOps integration. -
Reading
A curated list of recommended readings, blogs, papers, and books related to SRE, DevOps, and cloud computing. -
Troubleshooting
A comprehensive guide to troubleshooting infrastructure, applications, and system performance issues.
-
Algorithm Study Guide
Focused resources and exercises to master algorithms crucial for technical interviews and system optimization. -
Troubleshooting Interview Guide
Preparation materials and best practices for tackling troubleshooting interviews in SRE and DevOps roles. -
Data Structures
A study guide on data structures important for system design, performance optimization, and coding interviews.
Feel free to contribute by adding topics, sharing resources, or suggesting improvements. This journey is a continuous learning process, and all contributions are welcome.