Skip to content

Commit

Permalink
Some changes including new title.
Browse files Browse the repository at this point in the history
  • Loading branch information
rafalab committed Apr 22, 2024
1 parent 1b585a0 commit 72c7ba1
Show file tree
Hide file tree
Showing 55 changed files with 604 additions and 534 deletions.
4 changes: 2 additions & 2 deletions _quarto.yml
Original file line number Diff line number Diff line change
Expand Up @@ -6,12 +6,12 @@ execute:
cache: true

book:
title: Advanced Data Science
title: Introduction to Data Science
subtitle: Statistics and Prediction Algorithms Through Case Studies
reader-mode: true
page-footer:
left: |
Advanced Data Science was written by Rafael A. Irizarry
Introduction to Data Science was written by Rafael A. Irizarry
right: |
This book was built with <a href="https://quarto.org/">Quarto</a>.
site-url: http://rafalab.dfci.harvard.edu/dsbook-part-2
Expand Down
8 changes: 4 additions & 4 deletions docs/highdim/dimension-reduction.html
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@
<meta name="generator" content="quarto-1.3.353">
<meta name="viewport" content="width=device-width, initial-scale=1.0, user-scalable=yes">
<meta name="author" content="Rafael A. Irizarry">
<title>Advanced Data Science - 22&nbsp; Dimension reduction</title>
<title>Introduction to Data Science - 22&nbsp; Dimension reduction</title>
<style>
code{white-space: pre-wrap;}
span.smallcaps{font-variant: small-caps;}
Expand Down Expand Up @@ -109,7 +109,7 @@
<!-- sidebar -->
<nav id="quarto-sidebar" class="sidebar collapse collapse-horizontal sidebar-navigation floating overflow-auto"><div class="pt-lg-2 mt-2 text-left sidebar-header">
<div class="sidebar-title mb-0 py-0">
<a href="../">Advanced Data Science</a>
<a href="../">Introduction to Data Science</a>
<div class="sidebar-tools-main">
<a href="https://github.com/rafalab/dsbook-part-2" title="Source Code" class="quarto-navigation-tool px-1" aria-label="Source Code"><i class="bi bi-github"></i></a>
<a href="" class="quarto-reader-toggle quarto-navigation-tool px-1" onclick="window.quartoToggleReader(); return false;" title="Toggle reader mode">
Expand Down Expand Up @@ -748,7 +748,7 @@ <h1 class="title">
<span><span class="fu"><a href="https://rdrr.io/r/graphics/hist.html">hist</a></span><span class="op">(</span><span class="va">z</span><span class="op">[</span>,<span class="fl">1</span><span class="op">]</span>, breaks <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/seq.html">seq</a></span><span class="op">(</span><span class="op">-</span><span class="fl">4</span>,<span class="fl">4</span>,<span class="fl">0.5</span><span class="op">)</span><span class="op">)</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
</div>
<p>We can visualize these to see how the first component summarizes the data. In the plot below, red represents high values and blue negative values:</p>
<div class="cell" data-layout-align="center" height="5" data-hash="dimension-reduction_cache/html/illustrate-pca-twin-heights_f45b9d5d17ca2b561c3de6d50b0cddfd">
<div class="cell" data-layout-align="center" height="5" data-hash="dimension-reduction_cache/html/illustrate-pca-twin-heights_f66700fe465dd08f100048303c90bfdd">
<div class="cell-output-display">
<div class="quarto-figure quarto-figure-center">
<figure class="figure"><p><img src="dimension-reduction_files/figure-html/illustrate-pca-twin-heights-1.png" class="img-fluid figure-img" style="width:70.0%"></p>
Expand Down Expand Up @@ -1213,7 +1213,7 @@ <h1 class="title">
</nav>
</div> <!-- /content -->
<footer class="footer"><div class="nav-footer">
<div class="nav-footer-left">Advanced Data Science was written by Rafael A. Irizarry</div>
<div class="nav-footer-left">Introduction to Data Science was written by Rafael A. Irizarry</div>
<div class="nav-footer-center">
&nbsp;
</div>
Expand Down
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
6 changes: 3 additions & 3 deletions docs/highdim/intro-highdim.html
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@

<meta name="author" content="Rafael A. Irizarry">

<title>Advanced Data Science - High dimensional data</title>
<title>Introduction to Data Science - High dimensional data</title>
<style>
code{white-space: pre-wrap;}
span.smallcaps{font-variant: small-caps;}
Expand Down Expand Up @@ -91,7 +91,7 @@
<nav id="quarto-sidebar" class="sidebar collapse collapse-horizontal sidebar-navigation floating overflow-auto">
<div class="pt-lg-2 mt-2 text-left sidebar-header">
<div class="sidebar-title mb-0 py-0">
<a href="../">Advanced Data Science</a>
<a href="../">Introduction to Data Science</a>
<div class="sidebar-tools-main">
<a href="https://github.com/rafalab/dsbook-part-2" title="Source Code" class="quarto-navigation-tool px-1" aria-label="Source Code"><i class="bi bi-github"></i></a>
<a href="" class="quarto-reader-toggle quarto-navigation-tool px-1" onclick="window.quartoToggleReader(); return false;" title="Toggle reader mode">
Expand Down Expand Up @@ -662,7 +662,7 @@ <h1 class="title">High dimensional data</h1>
</div> <!-- /content -->
<footer class="footer">
<div class="nav-footer">
<div class="nav-footer-left">Advanced Data Science was written by Rafael A. Irizarry</div>
<div class="nav-footer-left">Introduction to Data Science was written by Rafael A. Irizarry</div>
<div class="nav-footer-center">
&nbsp;
<div class="toc-actions"><div><i class="bi bi-github"></i></div><div class="action-links"><p><a href="https://github.com/rafalab/dsbook-part-2/blob/main/highdim/intro-highdim.qmd" class="toc-action">View source</a></p><p><a href="https://github.com/rafalab/dsbook-part-2/issues/new" class="toc-action">Report an issue</a></p></div></div></div>
Expand Down
6 changes: 3 additions & 3 deletions docs/highdim/linear-algebra.html
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@
<meta name="generator" content="quarto-1.3.353">
<meta name="viewport" content="width=device-width, initial-scale=1.0, user-scalable=yes">
<meta name="author" content="Rafael A. Irizarry">
<title>Advanced Data Science - 21&nbsp; Applied Linear Algebra</title>
<title>Introduction to Data Science - 21&nbsp; Applied Linear Algebra</title>
<style>
code{white-space: pre-wrap;}
span.smallcaps{font-variant: small-caps;}
Expand Down Expand Up @@ -109,7 +109,7 @@
<!-- sidebar -->
<nav id="quarto-sidebar" class="sidebar collapse collapse-horizontal sidebar-navigation floating overflow-auto"><div class="pt-lg-2 mt-2 text-left sidebar-header">
<div class="sidebar-title mb-0 py-0">
<a href="../">Advanced Data Science</a>
<a href="../">Introduction to Data Science</a>
<div class="sidebar-tools-main">
<a href="https://github.com/rafalab/dsbook-part-2" title="Source Code" class="quarto-navigation-tool px-1" aria-label="Source Code"><i class="bi bi-github"></i></a>
<a href="" class="quarto-reader-toggle quarto-navigation-tool px-1" onclick="window.quartoToggleReader(); return false;" title="Toggle reader mode">
Expand Down Expand Up @@ -958,7 +958,7 @@ <h1 class="title"><span id="sec-matrix-algebra" class="quarto-section-identifier
</nav>
</div> <!-- /content -->
<footer class="footer"><div class="nav-footer">
<div class="nav-footer-left">Advanced Data Science was written by Rafael A. Irizarry</div>
<div class="nav-footer-left">Introduction to Data Science was written by Rafael A. Irizarry</div>
<div class="nav-footer-center">
&nbsp;
</div>
Expand Down
6 changes: 3 additions & 3 deletions docs/highdim/matrices-in-R.html
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@
<meta name="generator" content="quarto-1.3.353">
<meta name="viewport" content="width=device-width, initial-scale=1.0, user-scalable=yes">
<meta name="author" content="Rafael A. Irizarry">
<title>Advanced Data Science - 20&nbsp; Matrices in R</title>
<title>Introduction to Data Science - 20&nbsp; Matrices in R</title>
<style>
code{white-space: pre-wrap;}
span.smallcaps{font-variant: small-caps;}
Expand Down Expand Up @@ -109,7 +109,7 @@
<!-- sidebar -->
<nav id="quarto-sidebar" class="sidebar collapse collapse-horizontal sidebar-navigation floating overflow-auto"><div class="pt-lg-2 mt-2 text-left sidebar-header">
<div class="sidebar-title mb-0 py-0">
<a href="../">Advanced Data Science</a>
<a href="../">Introduction to Data Science</a>
<div class="sidebar-tools-main">
<a href="https://github.com/rafalab/dsbook-part-2" title="Source Code" class="quarto-navigation-tool px-1" aria-label="Source Code"><i class="bi bi-github"></i></a>
<a href="" class="quarto-reader-toggle quarto-navigation-tool px-1" onclick="window.quartoToggleReader(); return false;" title="Toggle reader mode">
Expand Down Expand Up @@ -1166,7 +1166,7 @@ <h1 class="title">
</nav>
</div> <!-- /content -->
<footer class="footer"><div class="nav-footer">
<div class="nav-footer-left">Advanced Data Science was written by Rafael A. Irizarry</div>
<div class="nav-footer-left">Introduction to Data Science was written by Rafael A. Irizarry</div>
<div class="nav-footer-center">
&nbsp;
</div>
Expand Down
8 changes: 4 additions & 4 deletions docs/highdim/matrix-factorization.html
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@
<meta name="generator" content="quarto-1.3.353">
<meta name="viewport" content="width=device-width, initial-scale=1.0, user-scalable=yes">
<meta name="author" content="Rafael A. Irizarry">
<title>Advanced Data Science - 24&nbsp; Matrix Factorization</title>
<title>Introduction to Data Science - 24&nbsp; Matrix Factorization</title>
<style>
code{white-space: pre-wrap;}
span.smallcaps{font-variant: small-caps;}
Expand Down Expand Up @@ -109,7 +109,7 @@
<!-- sidebar -->
<nav id="quarto-sidebar" class="sidebar collapse collapse-horizontal sidebar-navigation floating overflow-auto"><div class="pt-lg-2 mt-2 text-left sidebar-header">
<div class="sidebar-title mb-0 py-0">
<a href="../">Advanced Data Science</a>
<a href="../">Introduction to Data Science</a>
<div class="sidebar-tools-main">
<a href="https://github.com/rafalab/dsbook-part-2" title="Source Code" class="quarto-navigation-tool px-1" aria-label="Source Code"><i class="bi bi-github"></i></a>
<a href="" class="quarto-reader-toggle quarto-navigation-tool px-1" onclick="window.quartoToggleReader(); return false;" title="Toggle reader mode">
Expand Down Expand Up @@ -652,7 +652,7 @@ <h1 class="title">
<div class="cell" data-layout-align="center" data-hash="matrix-factorization_cache/html/unnamed-chunk-15_51dcda555208594fd5d18b0725d9acbe">
<div class="sourceCode" id="cb17"><pre class="downlit sourceCode r code-with-copy"><code class="sourceCode R"><span><span class="kw"><a href="https://rdrr.io/r/base/library.html">library</a></span><span class="op">(</span><span class="va"><a href="http://factominer.free.fr/missMDA/index.html">missMDA</a></span><span class="op">)</span></span>
<span><span class="va">ind</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/r/base/colSums.html">colSums</a></span><span class="op">(</span><span class="op">!</span><span class="fu"><a href="https://rdrr.io/r/base/NA.html">is.na</a></span><span class="op">(</span><span class="va">y</span><span class="op">)</span><span class="op">)</span> <span class="op">&gt;=</span> <span class="fl">25</span> <span class="op">|</span> <span class="fu"><a href="https://rdrr.io/r/base/colnames.html">colnames</a></span><span class="op">(</span><span class="va">y</span><span class="op">)</span> <span class="op">==</span> <span class="st">"3252"</span></span>
<span><span class="va">imputed</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/pkg/missMDA/man/imputePCA.html">imputePCA</a></span><span class="op">(</span><span class="va">r</span><span class="op">[</span>,<span class="va">ind</span><span class="op">]</span>, ncp <span class="op">=</span> <span class="fl">2</span>, coeff.ridge <span class="op">=</span> <span class="va">lambda</span><span class="op">)</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
<span><span class="va">imputed</span> <span class="op">&lt;-</span> <span class="fu">imputePCA</span><span class="op">(</span><span class="va">r</span><span class="op">[</span>,<span class="va">ind</span><span class="op">]</span>, ncp <span class="op">=</span> <span class="fl">2</span>, coeff.ridge <span class="op">=</span> <span class="va">lambda</span><span class="op">)</span></span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
</div>
<p>To see how much we improve our previous prediction, we construct a matrix with the ratings in the test set:</p>
<div class="cell" data-layout-align="center" data-hash="matrix-factorization_cache/html/unnamed-chunk-16_859c6077f27749e51e39ea9980cac6e1">
Expand Down Expand Up @@ -1120,7 +1120,7 @@ <h1 class="title">
</nav>
</div> <!-- /content -->
<footer class="footer"><div class="nav-footer">
<div class="nav-footer-left">Advanced Data Science was written by Rafael A. Irizarry</div>
<div class="nav-footer-left">Introduction to Data Science was written by Rafael A. Irizarry</div>
<div class="nav-footer-center">
&nbsp;
</div>
Expand Down
6 changes: 3 additions & 3 deletions docs/highdim/regularization.html
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@
<meta name="generator" content="quarto-1.3.353">
<meta name="viewport" content="width=device-width, initial-scale=1.0, user-scalable=yes">
<meta name="author" content="Rafael A. Irizarry">
<title>Advanced Data Science - 23&nbsp; Regularization</title>
<title>Introduction to Data Science - 23&nbsp; Regularization</title>
<style>
code{white-space: pre-wrap;}
span.smallcaps{font-variant: small-caps;}
Expand Down Expand Up @@ -110,7 +110,7 @@
<!-- sidebar -->
<nav id="quarto-sidebar" class="sidebar collapse collapse-horizontal sidebar-navigation floating overflow-auto"><div class="pt-lg-2 mt-2 text-left sidebar-header">
<div class="sidebar-title mb-0 py-0">
<a href="../">Advanced Data Science</a>
<a href="../">Introduction to Data Science</a>
<div class="sidebar-tools-main">
<a href="https://github.com/rafalab/dsbook-part-2" title="Source Code" class="quarto-navigation-tool px-1" aria-label="Source Code"><i class="bi bi-github"></i></a>
<a href="" class="quarto-reader-toggle quarto-navigation-tool px-1" onclick="window.quartoToggleReader(); return false;" title="Toggle reader mode">
Expand Down Expand Up @@ -1161,7 +1161,7 @@ <h1 class="title">
</nav>
</div> <!-- /content -->
<footer class="footer"><div class="nav-footer">
<div class="nav-footer-left">Advanced Data Science was written by Rafael A. Irizarry</div>
<div class="nav-footer-left">Introduction to Data Science was written by Rafael A. Irizarry</div>
<div class="nav-footer-center">
&nbsp;
</div>
Expand Down
Loading

0 comments on commit 72c7ba1

Please sign in to comment.