finish notes for chapter02

rj3sh · Dec 29, 2022 · 71ad9cc · 71ad9cc
1 parent 9de9c99
commit 71ad9cc
Show file tree

Hide file tree

Showing 17 changed files with 163 additions and 1 deletion.
diff --git a/system-design/system-design-interview/chapter02/README.md b/system-design/system-design-interview/chapter02/README.md
@@ -107,4 +107,166 @@ Here's the refined request lifecycle:
 Sweet, let's now improve the load/response time by adding a cache & shifting static content to a CDN.
 
 # Cache
-TODO
+Cache is a temporary storage which stores frequently accessed data or results of expensive computations.
+
+In our web application, every time a web page is loaded, expensive queries are sent to the database. 
+We can mitigate this using a cache.
+
+## Cache tier
+The cache tier is a temporary storage layer, from which results are fetched much more rapidly than from within a database.
+It can also be scaled independently from the database.
+![cache-tier](images/cache-tier.png)
+
+The example above is a read-through cache - server checks if data is available in the cache. If not, data is fetched from the database.
+
+## Considerations for using cache
+ * When to use it - usually useful when data is read frequently but modified infrequently. Caches usually don't preserve data upon restart so it's not a good persistence layer.
+ * Expiration policy - controls whether (and when) cached data expires and is removed from it. Make it too short - DB will be queried frequently. Make it too long - data will become stale.
+ * Consistency - How in sync should the data store & cache be? Inconsistency happens if data is changed in DB, but cache is not updated.
+ * Mitigating failures - A single cache server could be a single point of failure (SPOF). Consider over-provisioning it with a lot of memory and/or provisioning servers in multiple locations.
+ * Eviction policy - What happens when you want to add items to a cache, but it's full? Cache eviction policy controls that. Common policies - LRU, LFU, FIFO.
+
+# Content Delivery Network (CDN)
+CDN == network of geographically dispersed servers, used for delivering static content - eg images, HTML, CSS, JS files.
+
+Whenever a user requests some static content, the CDN server closest to the user serves it:
+![cdn](images/cdn.png)
+
+Here's the request flow:
+![cdn-request-flow](images/cdn-request-flow.png)
+ * User tries fetching an image via URL. URLs are provided by the CDN, eg `https://mysite.cloudfront.net/logo.jpg`
+ * If the image is not in the cache, the CDN requests the file from the origin - eg web server, S3 bucket, etc.
+ * Origin returns the image to the CDN with an optional TTL (time to live) parameter, which controls how long that static resource is to be cached.
+ * Subsequent users fetch the image from the CDN without any requests reaching the origin as long as it's within the TTL.
+
+## Considerations of using CDN
+ * Cost - CDNs are managed by third-parties for which you pay a fee. Be careful not to store infrequently accessed data in there.
+ * Cache expiry - consider appropriate cache expiry. Too short - frequent requests to origin. Too long - data becomes stale.
+ * CDN fallback - clients should be able to workaround the CDN provider if there is a temporary outage on their end.
+ * Invalidation - can be done via an API call or by passing object versions.
+
+Refined design of our web application:
+![web-app-design-after-cdn](images/web-app-design-after-cdn.png)
+
+# Stateless web tier
+In order to scale our web tier, we need to make it stateless.
+
+In order to do that, we can store user session data in persistent data storage such as our relational database or a NoSQL database.
+
+## Stateful architecture
+Stateful servers remember client data across different requests. Stateless servers don't.
+![stateful-servers](images/stateful-servers.png)
+
+In the above case, users are coupled to the server which stores their session data. If they make a request to another server, it won't have access to the user's session.
+
+This can be solved via sticky sessions, which most load balancers support, but it adds overhead.
+Adding/removing servers is much more challenging, which limits our options in case of server failures.
+
+## Stateless architecture
+![stateless-architecture](images/stateless-architecture.png)
+
+In this scenario, servers don't store any user data themselves. 
+Instead, they store it in a shared data store, which all servers have access to.
+
+This way, HTTP requests from users can be served by any web server.
+
+Updated web application architecture:
+![web-app-architecture-updated](images/web-app-architecture-updated.png)
+
+The user session data store could either be a relational database or a NoSQL data store, which is easier to scale for this kind of data.
+The next step in the app's evolution is supporting multiple data centers.
+
+# Data centers
+![data-centers](images/data-centers.png)
+
+In the above example, clients are geo-routed to the nearest data center based on the IP address.
+
+In the event of an outage, we route all traffic to the healthy data center:
+![data-center-failover](images/data-center-failover.png)
+
+To achieve this multi-datacenter setup, there are several issues we need to address:
+ * traffic redirection - tooling for correctly directing traffic to the right data center. GeoDNS can be used in this case.
+ * data synchronization - in case of failover, users from DC1 go to DC2. A challenge is whether their user data is there.
+ * test and deployment - automated deployment & testing is crucial to keep deployments consistent across DCs.
+
+To further scale the system, we need to decouple different system components so they can scale independently.
+
+# Message queues
+Message queues are durable components, which enable asynchronous communication.
+![message-queue](images/message-queue.png)
+
+Basic architecture:
+ * Producers create messages.
+ * Consumers/Subscribers subscribe to new messages and consume them.
+
+Message queues enable producers to be decoupled from consumers. 
+If a consumer is down, a producer can still publish a message and the consumer will receive it at a later point.
+
+Example use-case in our application - photo processing:
+ * Web servers publish "photo processing tasks" to a message queue
+ * A variable number of workers (can be scaled up or down) subscribe to the queue and process those tasks.
+![photo-processing-queue](images/photo-processing-queue.png)
+
+# Logging, metrics, automation
+Once your web application grows beyond a given point, investing in monitoring tooling is critical.
+ * Logging - error logs can be emitted to a data store, which can later be read by service operators.
+ * Metrics - collecting various types of metrics helps us collect business insight & monitor the health of the system.
+ * Automation - investing in continuous integration such as automated build, test, deployment can detect various problems early and also increases developer productivity.
+
+Updated system design:
+![sys-design-after-monitoring](images/sys-design-after-monitoring.png)
+
+# Database scaling
+There are two approaches to database scaling - vertical and horizontal.
+
+## Vertical scaling
+Also known as scaling up, it means adding more physical resources to your database nodes - CPU, RAM, HDD, etc.
+In Amazon RDS, for example, you can get a database node with 24 TB of RAM.
+
+This kind of database can handle lots of data - eg stackoverflow in 2013 had 10mil monthly unique visitors \w a single database node.
+
+Vertical scaling has some drawbacks, though:
+ * There are hardware limits to the amount of resources you can add to a node.
+ * You still have a single point of failure.
+ * Overall cost is high - the price of powerful servers is high.
+
+## Horizontal scaling
+Instead of adding bigger servers, you can add more of them:
+![vertical-vs-horizontal-scaling](images/vertical-vs-horizontal-scaling.png)
+
+Sharding is a type of database horizontal scaling which separates large data sets into smaller ones.
+Each shard shares the same schema, but the actual data is different.
+
+One way to shard the database is based on some key, which is equally distributed on all shards using the modulo operator:
+![database-sharding](images/database-sharding.png)
+
+Here's how the user data looks like in this example:
+![user-data-in-shards](images/user-data-in-shards.png)
+
+The sharding key (aka partition key) is the most important factor to consider when using sharding.
+In particular, the key should be chosen in a way that distributes the data as evenly as possible.
+
+Although a useful technique, it introduces a lot of complexities in the system:
+ * Resharding data - you need to do it if a single shard grows too big. This can happen rather quickly if data is distributed unevenly. Consistent hashing helps to avoid moving too much data around.
+ * Celebrity problem (aka hotspot) - one shard could be accessed much more frequently than others and can lead to server overload. We may have to resort to using separate shards for certain celebrities.
+ * Join and de-normalization - It is hard to perform join operations across shards. A common workaround is to de-normalize your tables to avoid making joins.
+
+Here's how our application architecture looks like after introducing sharding and a NoSQL database for some of the non-relational data:
+![updated-system-design](images/updated-system-design.png)
+
+# Millions of users and beyond
+Scaling a system is iterative.
+
+What we've learned so far can get us far, but we might need to apply even more sophisticated techniques to scale the application beyond millions of users.
+
+The techniques we saw so far can offer a good foundation to start from.
+
+Here's a summary:
+ * Keep web tier stateless
+ * Build redundancy at every layer
+ * Cache frequently accessed data
+ * Support multiple data centers
+ * Host static assets in CDNs
+ * Scale your data tier via sharding
+ * Split your big application into multiple services
+ * Monitor your system & use automation
diff --git a/system-design/system-design-interview/chapter02/images/cache-tier.png b/system-design/system-design-interview/chapter02/images/cache-tier.png
diff --git a/system-design/system-design-interview/chapter02/images/cdn-request-flow.png b/system-design/system-design-interview/chapter02/images/cdn-request-flow.png
diff --git a/system-design/system-design-interview/chapter02/images/cdn.png b/system-design/system-design-interview/chapter02/images/cdn.png
diff --git a/system-design/system-design-interview/chapter02/images/data-center-failover.png b/system-design/system-design-interview/chapter02/images/data-center-failover.png
diff --git a/system-design/system-design-interview/chapter02/images/data-centers.png b/system-design/system-design-interview/chapter02/images/data-centers.png
diff --git a/system-design/system-design-interview/chapter02/images/database-sharding.png b/system-design/system-design-interview/chapter02/images/database-sharding.png
diff --git a/system-design/system-design-interview/chapter02/images/message-queue.png b/system-design/system-design-interview/chapter02/images/message-queue.png
diff --git a/system-design/system-design-interview/chapter02/images/photo-processing-queue.png b/system-design/system-design-interview/chapter02/images/photo-processing-queue.png
diff --git a/system-design/system-design-interview/chapter02/images/stateful-servers.png b/system-design/system-design-interview/chapter02/images/stateful-servers.png
diff --git a/system-design/system-design-interview/chapter02/images/stateless-architecture.png b/system-design/system-design-interview/chapter02/images/stateless-architecture.png
diff --git a/...design/system-design-interview/chapter02/images/sys-design-after-monitoring.png b/...design/system-design-interview/chapter02/images/sys-design-after-monitoring.png
diff --git a/system-design/system-design-interview/chapter02/images/updated-system-design.png b/system-design/system-design-interview/chapter02/images/updated-system-design.png
diff --git a/system-design/system-design-interview/chapter02/images/user-data-in-shards.png b/system-design/system-design-interview/chapter02/images/user-data-in-shards.png
diff --git a/...ign/system-design-interview/chapter02/images/vertical-vs-horizontal-scaling.png b/...ign/system-design-interview/chapter02/images/vertical-vs-horizontal-scaling.png
diff --git a/...esign/system-design-interview/chapter02/images/web-app-architecture-updated.png b/...esign/system-design-interview/chapter02/images/web-app-architecture-updated.png
diff --git a/...em-design/system-design-interview/chapter02/images/web-app-design-after-cdn.png b/...em-design/system-design-interview/chapter02/images/web-app-design-after-cdn.png