-
Notifications
You must be signed in to change notification settings - Fork 0
/
index.html
125 lines (94 loc) · 5.51 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8">
<title>Another Book on Data Science</title>
<meta name="description" content="data science, R, Python, programming, machine learning">
<meta name="author" content="">
<!-- Le HTML5 shim, for IE6-8 support of HTML elements -->
<!--[if lt IE 9]>
<script src="https://html5shim.googlecode.com/svn/trunk/html5.js"></script>
<![endif]-->
<!-- Le styles -->
<link rel="stylesheet" href="bootstrap-1.1.0.min.css">
<link rel="stylesheet" href="style.css">
<link rel="stylesheet" href="small-screens.css">
<style>
h2 { padding-top: 0px; }
</style>
<script>
(function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){
(i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o),
m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m)
})(window,document,'script','//www.google-analytics.com/analytics.js','ga');
ga('create', 'UA-142297640-1', 'anotherbookondatascience.com');
ga('send', 'pageview');
</script>
</head>
<body>
<div class="container-fluid">
<div class="hero-unit">
<h1>Another Book on Data Science</h1>
<p>Learn R and Python in Parallel</p>
<span class="author">Nailong Zhang</span>
</div>
<!-- Example row of columns -->
<div class="row">
<div class="span8">
<h2>Why am I writing this book?</h2>
<p>
Maybe a major reason is an existential crisis.
The feedback from readers is another important reason. I submitted a git repo with three Chapters of this book in PDF format to Hacker News, and surprisingly the repo got 500 stars in a week. I received a few emails expressing thanks and interests in more Chapters.
</p>
<h2>About</h2>
<p>There has been considerable debate over choosing R vs. Python for Data Science. Based on my limited knowledge/experience, both R and Python are great languages and are worth learning; so why not learn them together?
</p>
<p>
Besides the side-by-side comparison of the two popular languages used in Data Science, this book also focuses on the translation from mathematical models to codes. In the book, the audience could find the applications/implementations of some important algorithms from scratch, such as maximum likelihood estimation, inversion sampling, copula simulation, simulated annealing, bootstrapping, linear regression (lasso/ridge regression), logistic regression, gradient boosting trees, etc.
</p>
<p>
The code can be found at this git <a href="https://github.com/rnorm/book_sample">repo</a>. If you have any idea to share or find any errors of the book, please contact me directly via email [email protected].
</p>
<p>Interested in a physical copy? It's available from <a target="_blank" rel="noopener noreferrer" href="https://www.amazon.com/Tour-Data-Science-Chapman-Hall/dp/0367895862">Amazon</a> or <a target="_blank" rel="noopener noreferrer" href="https://www.routledge.com/A-Tour-of-Data-Science-Learn-R-and-Python-in-Parallel/Zhang/p/book/9780367895860">Routledge</a>.
</p>
</div>
<div class="span6">
<h2>Chapters</h2>
<dl>
<dt><a href="chapter1.html">Introduction to R/Python Programming</a></dt>
<dd>calculator, variable & type, functions, control flows, some built-in data structures, object-oriented programming</dd>
<dt><a href="chapter2.html">More on R/Python Programming</a></dt>
<dd>write & run R/Python scripts, debugging, benchmarking, vectorization, embarrassingly parallelism, evaluation strategy, speed up with C/C++, functional programming</dd>
<dt><a href="chapter3.html">data.table and pandas</a></dt>
<dd>SQL, introduction to data.table and pandas, indexing & selecting data, add/remove/update, group by, join</dd>
<dt><a href="chapter4.html">Random Variables & Distributions</a></dt>
<dd>sampling, distribution fitting, joint distribution/copula simulation, confidence interval, hypothesis testing</dd>
<dt><a href="chapter5.html">Linear Regression</a></dt>
<dd>basics of linear regression, linear hypothesis testing, ridge regression</dd>
<dt><a href="chapter6.html">Optimization in Practice</a></dt>
<dd>convexity, gradient descent, root-finding, general purpose minimization tools, linear programming, simulated annealing</dd>
<dt><a href="chapter7.html">Machine Learning – A gentle introduction</a></dt>
<dd>learning paradigm, universal approximation, overfitting, gradient boosting machine, reinforcement learning, computational differentiation</dd>
</dl>
</div>
</div>
</div> <!-- /container-fluid -->
</body>
</html>
</div> <!-- /container-fluid -->
<!-- Footer -->
<div id="footer">
<div class="inner">
<div class="container-fluid">
<div class="row">
<div class="span16">
<p style="text-align:center">
© 2022 Anotherbookondatascience.com
</p>
</div>
</div>
</div>
</div>
</div>
</body>
</html>