From 528b35a23c02e8200659bcf219f0ae336241bc64 Mon Sep 17 00:00:00 2001 From: Nick Byrne Date: Wed, 19 Aug 2015 13:40:53 +1000 Subject: [PATCH 1/3] initialising nick-byrne-transcript.md --- transcripts/nick-byrne-transcript.md | 6 ++++++ 1 file changed, 6 insertions(+) create mode 100644 transcripts/nick-byrne-transcript.md diff --git a/transcripts/nick-byrne-transcript.md b/transcripts/nick-byrne-transcript.md new file mode 100644 index 00000000..0efdb20c --- /dev/null +++ b/transcripts/nick-byrne-transcript.md @@ -0,0 +1,6 @@ +

Nick Byrne Transcript

+**Open Source Data Science Masters** + +This is here to facilitate an initial pull request + +Want to collaborate? Get in touch: [twitter](http://www.twitter.com/byrnenick)) or [email](mailto:nick@thinkactlive.com) From 9e6c2eb945e2ad224f7147abd86da8a782ff8c3a Mon Sep 17 00:00:00 2001 From: Nick Byrne Date: Wed, 19 Aug 2015 14:24:52 +1000 Subject: [PATCH 2/3] Commiting my initial planned OSDSM curriculum --- transcripts/nick-byrne-transcript.md | 79 +++++++++++++++++++++++++++- 1 file changed, 77 insertions(+), 2 deletions(-) diff --git a/transcripts/nick-byrne-transcript.md b/transcripts/nick-byrne-transcript.md index 0efdb20c..d7101257 100644 --- a/transcripts/nick-byrne-transcript.md +++ b/transcripts/nick-byrne-transcript.md @@ -1,6 +1,81 @@

Nick Byrne Transcript

**Open Source Data Science Masters** -This is here to facilitate an initial pull request +Want to collaborate? Get in touch: + * [twitter](http://www.twitter.com/byrnenick)); or + * [email](mailto:nick@thinkactlive.com) -Want to collaborate? Get in touch: [twitter](http://www.twitter.com/byrnenick)) or [email](mailto:nick@thinkactlive.com) +**OpenSource Data Science Masters Curriculum** +Below is a planned curriculum that I'm looking to follow. As with life, I'm not expecting it to be followed linearly necessarily. And I may swap courses in and out as interesting things arise. +I do plan to take at least one element from all of the recommended themes published in the OpenSource Data Science masters. And I'm favouring online courses as it's obviously easier to stay honest with regards to progress over reading a book and claiming that you know the subject matter. + +

Recognised openSource curriculum

+

Base Introduction

+Data Science Introductions + - [ ] Intro to Data Science by UW / Coursera, online course + - [ ] Data Science by Harvard, online course + - [ ] Data Science with Open Source Tools, book + - [ ] Introduction to Computer Science and Programming, by MIT OpenCourseWare +*Intro to CS was listed in Python(Learning) section but felt it would be a good one to bring up front (despite having a good grasp of python) + +Mathematics + - [ ] Linear Programming (Math 407) University of Washington + - [ ] Statistics by Princeton & Coursera + - [ ] Differential Equations in Data Science, Python tutorial + - [ ] Problem-Solving Heuristics "How to Solve It" by Polya, Book + +

Computing

+Algorithms + - [ ] Algorithms Design & Analysis, by Stanford and Coursera + +Distributed Computing Paradigms + - [ ] Intro to Hadoop and MapReduce by Cloudera and Udacity +*Note: I might swap the above course with an EdX course on Apache Spark and distributed computing* + +Databases + - [ ] Introduction to Databases, by Stanford + +Data Mining + - [ ] Mining Massive Data Sets, by Stanford and Coursera + +Machine Learning - Foundational & Theoretical + - [ ] Machine Learning, by Ng Stanford and Coursera (**in-progress**) + - [ ] The Elements of Statistical Learning, by Stanford + +Machine Learning - Practical + - [ ] Programming Collective Intelligence + - [ ] Intro to scikit-learn, by SciPy2013 + +Probabilistic Modeling + - [ ] Probabilistic Graphical Models, by Stanford and Coursera + +Deep Learning (Neural Networks) + - [ ] Neural Networks, by Univesity of Toronto and Coursera + +Natural Language Processing + - [ ] From Languages to Information, by Stanford + - [ ] NLP with Python (NLKT library) + +Analysis + - [ ] Big Data Analysis with Twitter, by UC Berkeley + +

Data Design

+*To be confirmed* + + +

Relevant prior studies

+ - [X] Adelaide University, Mathematics 1011 + - [X] Adelaide University, Statistics 1001 + - [X] Adelaide University, Engineering Modelling and Analysis 1003 + - [X] Adelaide University, Mathematics 1012 + - [X] Adelaide University, Differential Equations and Statistical Methods 2010 + - [X] Adelaide University, Engineering Modelling and Analysis 2010 + - [X] Adelaide University, Engineering Modelling and Analysis 3009 + - [X] Adelaide University, Environmental Modelling, Management and Design 4987 + + +**OpenSource Data Science Masters Capstone Project** +I would like to do a capstone project focused on using big data to understand workplace dynamics, and more appropriate hiring decisions. E.g. can we use big data to better understand an employees cultural fit? +As I progress through the curriculum, I'll better define the capstone project. + +If you'd like to pair up for the capstone, [let me know](http://www.twitter.com/byrnenick)) From 13fdb6f2297b3abebbafc26d0983c5cf77bbadde Mon Sep 17 00:00:00 2001 From: Nick Byrne Date: Wed, 19 Aug 2015 14:27:07 +1000 Subject: [PATCH 3/3] Update nick-byrne-transcript.md Simple formatting edits --- transcripts/nick-byrne-transcript.md | 9 +++++---- 1 file changed, 5 insertions(+), 4 deletions(-) diff --git a/transcripts/nick-byrne-transcript.md b/transcripts/nick-byrne-transcript.md index d7101257..84433940 100644 --- a/transcripts/nick-byrne-transcript.md +++ b/transcripts/nick-byrne-transcript.md @@ -1,11 +1,12 @@

Nick Byrne Transcript

-**Open Source Data Science Masters** +**Open Source Data Science Masters**
+I'm currently looking for people to pair with, and work on a capstone project
Want to collaborate? Get in touch: - * [twitter](http://www.twitter.com/byrnenick)); or + * [twitter](http://www.twitter.com/byrnenick); or * [email](mailto:nick@thinkactlive.com) -**OpenSource Data Science Masters Curriculum** +**OpenSource Data Science Masters Curriculum**
Below is a planned curriculum that I'm looking to follow. As with life, I'm not expecting it to be followed linearly necessarily. And I may swap courses in and out as interesting things arise. I do plan to take at least one element from all of the recommended themes published in the OpenSource Data Science masters. And I'm favouring online courses as it's obviously easier to stay honest with regards to progress over reading a book and claiming that you know the subject matter. @@ -78,4 +79,4 @@ Analysis I would like to do a capstone project focused on using big data to understand workplace dynamics, and more appropriate hiring decisions. E.g. can we use big data to better understand an employees cultural fit? As I progress through the curriculum, I'll better define the capstone project. -If you'd like to pair up for the capstone, [let me know](http://www.twitter.com/byrnenick)) +If you'd like to pair up for the capstone, [let me know](http://www.twitter.com/byrnenick)