Skip to content

Commit

Permalink
Merge pull request #2 from jygrinberg/patch-2
Browse files Browse the repository at this point in the history
Fixing minor typos in introduction/index.html.
  • Loading branch information
ermonste authored Jan 16, 2017
2 parents df2d288 + a48350f commit 06621f1
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions docs/preliminaries/introduction/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -142,7 +142,7 @@ <h2 id="the-difficulties-of-probabilistic-modeling">The difficulties of probabil

<h2 id="describing-probabilities-with-graphs">Describing probabilities with graphs</h2>

<p>Our independence assumption can be conveniently represented in the form of a graph.<label for="nb1" class="margin-toggle"></label><input type="checkbox" id="nb1" class="margin-toggle" /><span class="marginnote"><img class="fullwidth" src="/cs228-notes/assets/img/naive-bayes.png" /><br />Graphical representation of the Naive Bayes spam classification model. We can interpret the directed graph as indicating a story of how the data was generated: first, we a spam/non-spam label was chosen at random; then a subset of <script type="math/tex">n</script> possible English words sampled independently and at random.</span>
<p>Our independence assumption can be conveniently represented in the form of a graph.<label for="nb1" class="margin-toggle"></label><input type="checkbox" id="nb1" class="margin-toggle" /><span class="marginnote"><img class="fullwidth" src="/cs228-notes/assets/img/naive-bayes.png" /><br />Graphical representation of the Naive Bayes spam classification model. We can interpret the directed graph as indicating a story of how the data was generated: first, a spam/non-spam label was chosen at random; then a subset of <script type="math/tex">n</script> possible English words sampled independently and at random.</span>
This representation has the immediate advantage of being easy to understand. It can be interpreted as telling us a story: an email was generated by first choosing at random whether the email is spam or not (indicated by <script type="math/tex">y</script>), and then by sampling words one at a time. Conversely, if we have a story of how our dataset was generated, we can naturally express it as a graph with an associated probability distribution.</p>

<p>More importantly, we want to submit various queries to the model (e.g. what is the probability of spam given that I see the word “pill”?); answering these questions will require specialized algorithms that will be most naturally defined using graph-theoretical concepts. We will also use graph theory to analyze the speed of learning algorithms and to quantify the computational complexity (e.g. NP-hardness) of different learning tasks.</p>
Expand Down Expand Up @@ -182,7 +182,7 @@ <h3 id="inference">Inference</h3>

<h3 id="learning">Learning</h3>

<p>Our last key task refers to fitting a model to a dataset, which could be for example a large number of labeled examples of spam. By looking at the data, we can infer useful patterns (e.g. which word are found more frequently in spam emails), which we can then use to make predictions about the future. However, we will see that learning and inference are also inherently linked in a more subtle way, since inference will turn out to be a key subroutine that we will repeatedly call within learning algorithms. Also, the topic of learning will feature important connections to the field of computational learning theory — which deals with questions such as generalization from limited data and overfitting — as well as to Bayesian statistics — which tells us (among other things) about how to combine prior knowledge and observed evidence in a principled way.</p>
<p>Our last key task refers to fitting a model to a dataset, which could be for example a large number of labeled examples of spam. By looking at the data, we can infer useful patterns (e.g. which words are found more frequently in spam emails), which we can then use to make predictions about the future. However, we will see that learning and inference are also inherently linked in a more subtle way, since inference will turn out to be a key subroutine that we will repeatedly call within learning algorithms. Also, the topic of learning will feature important connections to the field of computational learning theory — which deals with questions such as generalization from limited data and overfitting — as well as to Bayesian statistics — which tells us (among other things) about how to combine prior knowledge and observed evidence in a principled way.</p>



Expand Down

0 comments on commit 06621f1

Please sign in to comment.