Skip to content

Commit

Permalink
Update readme.md for various projects
Browse files Browse the repository at this point in the history
  • Loading branch information
dennyglee committed Apr 27, 2020
1 parent 8fa1803 commit b4450d9
Show file tree
Hide file tree
Showing 4 changed files with 29 additions and 13 deletions.
2 changes: 1 addition & 1 deletion 2020-04-22 | Machine Learning with scikit-learn/README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
## Machine Learning with scikit-learn | Introduction to Data Analysis for Aspiring Data Scientists

2020-04-22 | [Watch the video]() | This folder contains the presentation and sample notebooks
2020-04-22 | [Watch the video](https://www.youtube.com/watch?v=g103iO-izoI) | This folder contains the presentation and sample notebooks

Join us for a four part learning series: *Introduction to Data Analysis for Aspiring Data Scientists*. This is the third of four online workshops for anyone and everyone interested in learning about data analysis. No previous programming experience required.

Expand Down
11 changes: 11 additions & 0 deletions 2020-04-29 | Intro to Apache Spark/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
## Introduction to Apache Spark | Introduction to Data Analysis for Aspiring Data Scientists

2020-04-29 | [Watch the video](https://www.youtube.com/watch?v=9U4ED7KQwlE) | This folder contains the presentation and sample notebooks

Join us for Part 4 of our online learning series: *Introduction to Data Analysis for Aspiring Data Scientists*. This is the final online workshop in this series for anyone and everyone interested in learning about data analysis.

**Part 4: Introduction to Apache Spark**

Abstract: This workshop covers the fundamentals of Apache Spark, the most popular big data processing engine. In this workshop, you will learn how to ingest data with Spark, analyze the Spark UI, and gain a better understanding of distributed computing. We will be using data released by the [New York Times](https://github.com/nytimes/covid-19-data). No prior knowledge of Spark is required, but Python experience is highly recommended.

Who should attend this workshop: Anyone and everyone, CS students and even non-technical folks are welcome to join.
29 changes: 17 additions & 12 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,24 +16,19 @@ Sections

## Upcoming-Tech-Talks

#### <img src="./images/Machine_learning_Black-01.png" width="32"/> 2020-04-22 - [Workshop | Introduction to Data Analysis for Aspiring Data Scientists: Machine Learning with scikit-learn](https://www.meetup.com/data-ai-online/events/269838467/)
<blockquote>
scikit-learn is one of the most popular open-source machine learning libraries among data science practitioners. This workshop will walk through what machine learning is, the different types of machine learning, and how to build a simple machine learning model. This workshop focuses on the techniques of applying and evaluating machine learning methods, rather than the statistical concepts behind them. We will be using data released by the <a href="https://github.com/CSSEGISandData/COVID-19" target="_blank">Johns Hopkins Center for Systems Science and Engineering (CSSE) Novel Coronavirus (COVID-19)</a>. Prior basic Python experience is recommended.
</blockquote><br/>
<img src="./images/introduction-to-data-analysis-for-aspiring-data-scientists-part-3.jpg" width="800"/><br/>

#### <img src="https://pages.databricks.com/rs/094-YMS-629/images/delta-lake-tiny-logo.png"> 2020-04-23 [Predictive Maintenance (PdM) on IoT Data for Early Fault Detection w/ Delta Lake](https://www.meetup.com/data-ai-online/events/270166033/)
<blockquote>
Predictive Maintenance (PdM) is different from other routine or time-based maintenance approaches as it combines various sensor readings and sophisticated analytics on thousands of logged events in near real time and promises several fold improvements in cost savings because tasks are performed only when warranted. The collaborative Data and Analytics platform from Databricks is a great technology fit to facilitate these use cases by providing a single unified platform to ingest the sensor data, perform the necessary transformations and exploration, run ML and generate valuable insights.
</blockquote><br/>
<img src="./images/predictive-maintenance-iot.jpg" width="800"/><br/>

#### <img src="./images/Machine_learning_Black-01.png" width="32"/> 2020-04-29 - [Workshop | Introduction to Data Analysis for Aspiring Data Scientists: Introduction to Apache Spark](https://www.meetup.com/data-ai-online/events/270166620/)
<blockquote>
This workshop covers the fundamentals of Apache Spark, the most popular big data processing engine. In this workshop, you will learn how to ingest data with Spark, analyze the Spark UI, and gain a better understanding of distributed computing. We will be using data released by the <a href="https://github.com/CSSEGISandData/COVID-19" target="_blank">Johns Hopkins Center for Systems Science and Engineering (CSSE) Novel Coronavirus (COVID-19)</a>. Prior basic Python experience is recommended.
</blockquote><br/>
<img src="./images/introduction-to-data-analysis-for-aspiring-data-scientists-part-4.jpg" width="800"/><br/>


#### <img src="https://pages.databricks.com/rs/094-YMS-629/images/delta-lake-tiny-logo.png"> 2020-04-30 [Using Delta as a Change Data Capture Source](https://www.meetup.com/data-ai-online/events/270202602/)
<blockquote>
While it is common to use Delta Lake as a sink for change data captured from traditional data sources; customers are increasingly asking how to use Delta tables as a source for a change data capture (CDC) process. To state a different way, how can we read a stream of changes from a Delta table, so that they can be propagated downstream. In each of these cases, we want to capture a change stream from a Delta table and send it somewhere for further processing. In this session, we will discuss the architecture, use cases, and solutions.
</blockquote>
<img src="./images/using-delta-as-a-change-data-capture-source.jpeg" width="800"/><br/>

<a name="Featured"/>

## Featured
Expand Down Expand Up @@ -62,6 +57,16 @@ Sections

## Previous-Tech-Talks

#### <img src="https://pages.databricks.com/rs/094-YMS-629/images/delta-lake-tiny-logo.png"> 2020-04-23 [Predictive Maintenance (PdM) on IoT Data for Early Fault Detection w/ Delta Lake](https://www.meetup.com/data-ai-online/events/270166033/)
<blockquote>
Predictive Maintenance (PdM) is different from other routine or time-based maintenance approaches as it combines various sensor readings and sophisticated analytics on thousands of logged events in near real time and promises several fold improvements in cost savings because tasks are performed only when warranted. The collaborative Data and Analytics platform from Databricks is a great technology fit to facilitate these use cases by providing a single unified platform to ingest the sensor data, perform the necessary transformations and exploration, run ML and generate valuable insights.
</blockquote><br/>

#### <img src="./images/Machine_learning_Black-01.png" width="32"/> 2020-04-22 - [Workshop | Introduction to Data Analysis for Aspiring Data Scientists: Machine Learning with scikit-learn](https://www.meetup.com/data-ai-online/events/269838467/)
<blockquote>
scikit-learn is one of the most popular open-source machine learning libraries among data science practitioners. This workshop will walk through what machine learning is, the different types of machine learning, and how to build a simple machine learning model. This workshop focuses on the techniques of applying and evaluating machine learning methods, rather than the statistical concepts behind them. We will be using data released by the <a href="https://github.com/CSSEGISandData/COVID-19" target="_blank">Johns Hopkins Center for Systems Science and Engineering (CSSE) Novel Coronavirus (COVID-19)</a>. Prior basic Python experience is recommended.
</blockquote><br/>

#### <img src="https://pages.databricks.com/rs/094-YMS-629/images/delta-lake-tiny-logo.png"> 2020-04-16 - [Diving into Delta Lake: DML Internals](https://www.youtube.com/watch?v=7ewmcdrylsA)
<blockquote>
In the earlier Delta Lake Internals webinar series sessions, we described how the Delta Lake transaction log works. In this session, we will dive deeper into how commits, snapshot isolation, and partition and files change when performing deletes, updates, merges, and structured streaming.
Expand Down
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit b4450d9

Please sign in to comment.