-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathtelescope.txt
328 lines (181 loc) · 208 KB
/
telescope.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
<p>Wow, I didn’t expect the process of installing Telescope to be this hard, I didn’t expect that it would be difficult to just install the prerequisites, it was pretty rough. Installing Microsoft WSL2 was an “interesting” experience to say the least, it required installing Ubuntu and a Linux kernel update but that part was easy, the hard part came when I tried to go from WSL1 to WSL2. I saw that it wouldn’t allow me to update to WSL2 and I had to troubleshoot the issues, the first issue I found was that I was on an old update of windows, 1909 instead of 2004, which I had to update. Actually, not sure if the windows update was to get Docker to work, might have been both. The second issue I found was that virtualization was not turned enabled in my BIOS, so I had to turn that on. It was quite difficult to find that issue, I had virtualization turned on in my system settings so I had to figure out why it wasn’t working on my PC to enable WSL2, I got virtualization and WSL2 to work in the end though. </p>
<p>I also had some trouble with installing Redis and Elasticsearch, it wasn’t too difficult, however, I got it to run on Ubuntu but not in Visual Studio Code, this is the test for Elasticsearch on Ubuntu:</p>
<figure><img src="https://osdnathanp.files.wordpress.com/2020/11/image-7.png?w=498" loading="lazy"></figure>
<p>In the end, I had to edit some settings in .env of the Telescope folder, the MOCK_REDIS=1 and MOCK_ELASTIC=1 so I could test it and it deployed in the end. I will continue to try to fix it, I’m not sure what’s wrong though.</p>
<figure><img src="https://osdnathanp.files.wordpress.com/2020/11/image-2.png?w=1024" loading="lazy">localhost:3000/feeds</figure>
<figure><img src="https://osdnathanp.files.wordpress.com/2020/11/image-3.png?w=1024" loading="lazy"><a href="http://localhost:8000/">http://localhost:8000/</a></figure>
<p>I think where I could have went wrong was when I saw “In the root directory, run <code>cp env.example .env</code> on Linux/Mac or <code>copy env.example .env</code>.” I didn’t know where the root directory was for sure, I tried on Ubuntu but it said that “Permission denied” and I couldn’t figure out how to access it, I also thought that it might have been the wrong area but I couldn’t think of a different root directory.</p>
<p>Another issue could have been Docker however I thought I got that right, not 100% sure though. I think it’s correct but I’ve never used it before.</p>
<figure><img src="https://osdnathanp.files.wordpress.com/2020/11/image-4.png?w=875" loading="lazy"></figure>
<p>Coding the addition to my PYLinkChecker.py wasn’t hard, the hard part was the set-up. My original PYLinkChecker.py code was in good shape to add the feature, but I made sure to create a separate python file to test if the feature would work before I added it to the main file which made testing easier.</p>
<p>I hope that it’s not this hard to install and test software like this in the future but I know it’s probably just going to get harder, so I’ll keep this experience in mind and hope for easy installation in the future. I also created a <a href="https://gist.github.com/NathanPang001/f1612d1147a76f81715b8a31b9873d03">gist</a> for this project with a diff file and my python file. I would include some commands, my steps, etc., but this blog has gone on for too long already, if you made it this far, congratulations!</p>
<p>TLDR; Had trouble installing, technically got through it in the end.</p><p>The biggest takeaway from October is how I managed to get in and start contributing to Open Source(OS) projects. Telescope is one of the higher tier OS projects I finally got the chance to work with. Telescope is a web application that aggregates blog feeds and displays all of them under one roof. In this blog, I will talk about my experience setting up Telescope to work with my link checker project. </p>
<p>The highlight of my week with OSD600 is the setup of Telescope. If you ever had to set up your environment to run a project, the next few sentences will not sound new to you. I started by looking at the Telescope wiki to find myself reading the names of new technologies. On the frontend, the code is written in React using a framework called Gatsby, which in the backend one of the servers that are running is a data-caching server called Redis that loads the feeds quickly from memory. There are other services that needs to be configured before running Telescope locally which are Elasticsearch, Docker, Node.js. Once all these services and technologies configured locally only then we can start having them run altogether. As David mentioned in his videos, in order to run a project like Telescope all these services are running like a network (an IT ecosystem); and my challenge was exactly that. </p>
<p>Using a Windows operating system to run all these services is still vague for me. I wasn’t ready to install another virtual machine using Oracle, because frankly, it’s a lot of work and sometimes a waste of time. Enter WSL, Windows Subsystem for Linus, that lets developers run Linux environment directly on Windows without the use of traditional virtual machines or dual-boot setup. It came pre-packaged with the common command-line tools and utilities (such as cd, apt, git, etc) that I can use to set up my environment. WSL 2 is the better version hence the first thing I did was enable the virtual machine feature and set my Linux distribution to default. After that, I was able to install redis, elasticsearch, docker engine, docker-compose and node into my machine. My first issue was understanding the flow of the wiki. Like with any wiki, it takes a while to understand the style of writing especially when there are a number of steps involved. Somehow I got myself pulling all the containers I needed to run the project but was kept getting errors. </p>
<p>The first mistake I had was docker not running at startup. Since I have docker before, I configured it in a way that it won’t run at startup due to memory issues. So I had to run docker engine manually before running any services. I also had issues with containers that I set up earlier but didn’t run properly because of cgroup mount error, which was later fixed. The entire process of setting up my environment and running Telescope took an entire day. Fortunately, I was able to talk to one of the devs, including David, who helped me through the process of having Telescope network up and running. With that, I was able to document these troubleshooting steps with a <a href="https://github.com/Seneca-CDOT/telescope/pull/1293">PR </a>to Telescope. Setting up an environment is a real pain, most of which I have documented in my PR, but it’s quite satisfying once everything is working the way it should.</p>
<p>Fast-forwarding to this week’s lab, we were asked to download the html content of the first 10 posts from Telescope’s backend only to later parse all the links in the html page and check the http status code of those links. Working with JSON files was easy for me since I worked on something similar to <a href="https://www.abuzayed.ca/my-blog/osd600/2020/10/20/utilitybelt-console-api/">one of my Hacktoberfest PRs</a>. The challenge was to parse out all the http links from these blog posts. After I got the entire blog post downloaded in an html format in a string, I simply ran a <code>foreach</code> that </p>
<figure><img src="https://www.abuzayed.ca/my-blog/osd600/wp-content/uploads/2020/11/image-3.png" loading="lazy"></figure>
<p>would split the string every time a double-quotation(“) mark was found. From I looked for any string that started with “http” and added that into a list of strings which will later be checked for its individual http status code.</p>
<figure><img src="https://www.abuzayed.ca/my-blog/osd600/wp-content/uploads/2020/11/image-4-1024x321.png" loading="lazy"></figure>
<p>It was nice to see my program checking the http status codes of links that was not from a file but from an actual post on the web. </p>
<p>Thank you for taking the time to read my blog. Until next time, stay safe and stay healthy!</p><h3>Telescope</h3>
<figure><img src="https://muskanshinh.files.wordpress.com/2020/11/gettyimages-960773850-85f50747071140b7bdc10eaf65fd55d8.jpg?w=768" loading="lazy"></figure>
<p>In the first few weeks of this course, my professor had mentioned about <a href="https://github.com/Seneca-CDOT/telescope">Telescope</a>. The first time I heard the term, I thought same as you probably did that the project telescope = something related to it astronomy/space. To my surprise, Telescope is kinda a blog post but not exactly. It displays all the blogs related to Open Source so that all things are in one place. It is also beneficial in a way that one is informed about upcoming technologies and the ongoing open source projects. </p>
<p>So for lab 6 we basically had to change the <a href="https://github.com/Shinh18/urltester/tree/lab6">Urltester</a> tool to check for broken network requests for the post urls from the Telescope API. I started to work on this lab in mid-week. I read the <a href="https://github.com/Seneca-CDOT/telescope/blob/master/docs/environment-setup.md">environment setup</a> requirements for the project and I got scared because the technologies were so new to me. Honestly, it got so overwhelming that I closed the GitHub tab and started working on another assignment. Then the other day I said to myself that I have to do it, I can’t get out of this. So I sat down with my coffee and starting reading about Redis and Elasticsearch. I’ll be honest, even after reading a lot I couldn’t wrap my head around it. I decided I’m going to start with it and see how far I get. </p>
<p>How far I get? I couldn’t even get through the first step. Let me tell you why. So, the option was to either install Redis and Elasticsearch as native applications or use Docker. I had installed and worked with Docker just a little bit for one of the projects I worked on in Release 0.2. So, I thought it might be easy. After forking the project, cloning it to local device and installing the necessary node packages, I ran the command </p>
<p><code>sudo systemctl start docker</code></p>
<p>And it didn’t work giving an error that the command is not recognized. So I searched the error, read some things but in vain. I decided I am gong to download both of them as native applications and then see if it works. Just before working on installations, to my rescue came my friend and open source colleague Palak who suggested that do not run this command, instead run the second command directly i.e. </p>
<p><code>docker-compose up --build redis elasticsearch</code></p>
<p>I was so happy to see the logs suggesting that it worked. After that, doing other steps wasn’t a very difficult task. I didn’t have Postman available on my device so I directly followed the url <a href="http://localhost:3000/posts">http://localhost:3000/posts</a> where I could see the JSON as below</p>
<figure><img src="https://muskanshinh.files.wordpress.com/2020/11/screen-shot-2020-11-08-at-2.45.05-am.png?w=1024" loading="lazy"></figure>
<p>The next task was to modify my Urltester tool to check these links and give status back accordingly. First I created a separate js file [ readFile.js ] to read and process the file passed to it as argument along with a boolean arguments for JSON string and all/good/bad flags. The goal was to add the functionality without taking away from the existing features that were implemented before. Another file getPosts.js was created to fetch posts from the liveserver localhost:3000 that was running, receive data [ id, url ] in response. I added the urls received into a posts array. Then, I created a new file postsUrl.txt to store all the arrays as JSON strings. </p>
<p>Then my main file – index.js was modified to send appropriate arguments to readFile.js depending upon if a filename is passed with or without [ –json | –all | –bad | –good]</p>
<p>It was not difficult to tweak the tool to give appropriate output for good, bad or unknown post urls from the Telescope API. </p>
<figure><img src="https://muskanshinh.files.wordpress.com/2020/11/screen-shot-2020-11-08-at-2.14.02-am.png?w=825" loading="lazy"></figure>
<h3>Git Diff</h3>
<p>After changing the tool, I did </p>
<pre>$ git add .
$ git commit -m "..."
$ git diff
</pre>
<p>Git diff didn’t show me anything whereas it was supposed to show the changes made to the files. I then figured that I had misunderstood it’s purpose. It was supposed to be run before staging the changes and of course, committing the changes.</p>
<p>I looked for a way to undo last commit and found the solution on <a href="https://stackoverflow.com/questions/927358/how-do-i-undo-the-most-recent-local-commits-in-git">StackOverFlow</a>. </p>
<pre>$ git commit -m "Something terribly misguided" # (0: Your Accident)
$ git reset HEAD~ # (1)
<< edit files as necessary >> # (2)
$ git add . # (3)
$ git commit -c ORIG_HEAD # (4)
</pre>
<p>This solved my problem. I ran <code>git diff</code> and it gave me a log of all the changes made to the file which I copy-pasted to my <a href="https://gist.github.com/Shinh18/34973a38fdb9f23dca4867bf7f6365f6">gist</a> along with the new files creates [ readFile.js, getPosts.js, postsUrl.txt ]</p><figure><img src="https://cdn-images-1.medium.com/max/1024/1*iZfPSF2eonWQw86fjRCjxw.jpeg" loading="lazy"></figure><p>Telescope, UrlPilgrim and DIFF+GIST.</p><p>I have been working on Telescope since the begging of the course, so I already have a local version of it running on my machine. I had never seen our #telescope’s slack channel so busy since I arrived on the project; I’m happy to see many Telescope documentation improvements.</p><p>URL PILGRIM TELESCOPE MODULE:</p><p>I create an argument that makes UrlPilgrim(my URL checker) target Telescope and check for URLs inside the lastest posts; I will explain how this module works:</p><p>When you run $>urlpilgrim — telescope, first UrlPilgrim will read the JSON that Telescope provides with the latest ten posts. After this, UrlPilgrim will build the URL to each post and then will download the content of each post and store it in an object; then it will test the content of this object (text/HTML) against the URL regex expression to get the URLs then UrlPilgrim will start to check the URLs found.</p><p>The workflow is something like this:</p><p>UrlPilgrim -> TELESCOPE last 10 posts ids-> UrlPilgrim build URL for each post in a object {posts} -> UrlPilgrim download the content of the posts in another object {contentOfPosts} -> UrlPilgrim extracts the URLs from the content and create a Set (to clean up the dups) then an Array of URLs [urlsFromContent] ->UrlPilgrim test the urls inside [urlsFromContent].</p><p>After this, I created a <a href="https://gist.github.com/PedroFonsecaDEV/84e869355a90d44d2904b25bddfdc101">GIST</a> with the DIFF of the UrlPilgrim before and after the Telescope module. (Running the command $>git diff OLD_COMMIT_HASH NEW_COMMIT_HASH).</p><p>The more I wrote JS, the more I love JS. Some many reasons for that.</p><p>See you soon.</p><img src="https://medium.com/_/stat?event=post.clientViewed&referrerSource=full_rss&postId=4190de949b55" loading="lazy"><p><a href="https://res.cloudinary.com/practicaldev/image/fetch/s--6Spz4kY3--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/3rh1wng067hz91mqr8m7.png"><img src="https://res.cloudinary.com/practicaldev/image/fetch/s--6Spz4kY3--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/3rh1wng067hz91mqr8m7.png" loading="lazy"></a></p>
<h2>
<a href="#setup-environment">
</a>
Setup environment
</h2>
<p>I tried to set up the telescope environment on my Windows PC. I installed wsl2 and Ubuntu. I also followed the instruction<br>
of the document. Some commands like <code>systemclt start</code> <code>init.d</code> failed.</p>
<p>Then I gave my old MacBook air a try. Docker runs well.<br>
<a href="https://res.cloudinary.com/practicaldev/image/fetch/s--gWw-O9Ig--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/khqwvyxh5rlrc3w7yd4a.png"><img src="https://res.cloudinary.com/practicaldev/image/fetch/s--gWw-O9Ig--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/khqwvyxh5rlrc3w7yd4a.png" loading="lazy"></a></p>
<p>But when I did <code>npm install</code>, there were some warnings, and When I ran <br>
<code>npm start</code> there was an error.</p>
<p><a href="https://res.cloudinary.com/practicaldev/image/fetch/s--iIx82JFn--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/iw1bjgacjos1cw0umrsq.png"><img src="https://res.cloudinary.com/practicaldev/image/fetch/s--iIx82JFn--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/i/iw1bjgacjos1cw0umrsq.png" loading="lazy"></a></p>
<p>It was frustrating that I had to go back to wsl, and installed <code>Redis</code> and <code>Elasticsearch</code> as native applications. And I learned how to run a program as a background Damon.<br>
</p>
<div>
<pre class="hljs bash"><code>Redis-server
ctrl-z
<span class="hljs-built_in">bg</span>
</code></pre>
<div>
<div>
Enter fullscreen mode
Exit fullscreen mode
</div>
</div>
</div>
<h2>
<a href="#change-the-code">
</a>
Change the code.
</h2>
<p>My linkChecker can check urls from a file, and the logic of my program is processing the file content into a string. So I had to convert the body of each post to a string.<br>
I added one flag variable to check from the url <code>http:localhost:3000</code>, and extract all the posts to a string. Here is the change I have made.<br>
</p>
<div>
<pre class="hljs go"><code>diff --git a/helpers.<span class="hljs-keyword">go</span> b/helpers.<span class="hljs-keyword">go</span>
index <span class="hljs-number">74</span>ffce1.<span class="hljs-number">.6</span>c2a61e <span class="hljs-number">100644</span>
--- a/helpers.<span class="hljs-keyword">go</span>
+++ b/helpers.<span class="hljs-keyword">go</span>
@@ <span class="hljs-number">-17</span>,<span class="hljs-number">6</span> +<span class="hljs-number">17</span>,<span class="hljs-number">11</span> @@ <span class="hljs-keyword">import</span> (
<span class="hljs-string">"mvdan.cc/xurls/v2"</span>
)
+<span class="hljs-keyword">type</span> Post <span class="hljs-keyword">struct</span> {
+ ID <span class="hljs-keyword">string</span>
+ URL <span class="hljs-keyword">string</span>
+}
+
<span class="hljs-comment">// extract all urls from the string and return them as a slice (array)</span>
<span class="hljs-function"><span class="hljs-keyword">func</span> <span class="hljs-title">extractURL</span><span class="hljs-params">(str <span class="hljs-keyword">string</span>)</span> []<span class="hljs-title">string</span></span> {
rxStrict := xurls.Strict()
@@ <span class="hljs-number">-24</span>,<span class="hljs-number">6</span> +<span class="hljs-number">29</span>,<span class="hljs-number">37</span> @@ <span class="hljs-function"><span class="hljs-keyword">func</span> <span class="hljs-title">extractURL</span><span class="hljs-params">(str <span class="hljs-keyword">string</span>)</span> []<span class="hljs-title">string</span></span> {
<span class="hljs-keyword">return</span> foundUrls
}
+<span class="hljs-function"><span class="hljs-keyword">func</span> <span class="hljs-title">parseFromTelescope</span><span class="hljs-params">()</span> []<span class="hljs-title">string</span></span> {
+ <span class="hljs-keyword">var</span> allPostBody []<span class="hljs-keyword">byte</span>
+ resp, err := http.Get(<span class="hljs-string">"http://localhost:3000/posts"</span>)
+ <span class="hljs-keyword">if</span> err != <span class="hljs-literal">nil</span> {
+ <span class="hljs-built_in">panic</span>(err)
+ }
+ <span class="hljs-keyword">defer</span> resp.Body.Close()
+
+ body, err := ioutil.ReadAll(resp.Body)
+ <span class="hljs-keyword">if</span> err != <span class="hljs-literal">nil</span> {
+ <span class="hljs-built_in">panic</span>(err)
+ }
+ <span class="hljs-keyword">var</span> posts []Post
+ <span class="hljs-keyword">if</span> err := json.Unmarshal(body, &posts); err != <span class="hljs-literal">nil</span> {
+ <span class="hljs-built_in">panic</span>(err)
+ }
+ <span class="hljs-keyword">for</span> _, post := <span class="hljs-keyword">range</span> posts {
+ resp, err := http.Get(<span class="hljs-string">"http://localhost:3000"</span> + post.URL)
+ <span class="hljs-keyword">if</span> err != <span class="hljs-literal">nil</span> {
+ log.Fatal(err)
+ }
+ <span class="hljs-keyword">defer</span> resp.Body.Close()
+ body, err := ioutil.ReadAll(resp.Body)
+ <span class="hljs-keyword">if</span> err != <span class="hljs-literal">nil</span> {
+ log.Fatal(err)
+ }
+ allPostBody = <span class="hljs-built_in">append</span>(allPostBody, body...)
+ }
+ <span class="hljs-keyword">return</span> extractURL(<span class="hljs-keyword">string</span>(allPostBody))
+}
+
<span class="hljs-comment">//Function to parse ignore URLs from provided ignore file path</span>
<span class="hljs-function"><span class="hljs-keyword">func</span> <span class="hljs-title">parseIgnoreURL</span><span class="hljs-params">(ignoreFilePath <span class="hljs-keyword">string</span>)</span> []<span class="hljs-title">string</span></span> {
<span class="hljs-keyword">var</span> ignoreURLs []<span class="hljs-keyword">string</span>
diff --git a/urlChecker.<span class="hljs-keyword">go</span> b/urlChecker.<span class="hljs-keyword">go</span>
index <span class="hljs-number">9</span>c97457.<span class="hljs-number">.41</span>ab8c4 <span class="hljs-number">100644</span>
--- a/urlChecker.<span class="hljs-keyword">go</span>
+++ b/urlChecker.<span class="hljs-keyword">go</span>
@@ <span class="hljs-number">-25</span>,<span class="hljs-number">6</span> +<span class="hljs-number">25</span>,<span class="hljs-number">8</span> @@ <span class="hljs-function"><span class="hljs-keyword">func</span> <span class="hljs-title">main</span><span class="hljs-params">()</span></span> {
<span class="hljs-comment">// ignore url flag</span>
ignoreFlag := flag.BoolP(<span class="hljs-string">"ignore"</span>, <span class="hljs-string">"i"</span>, <span class="hljs-literal">false</span>, <span class="hljs-string">"ignore url patterns"</span>)
+ telescopeFlag := flag.BoolP(<span class="hljs-string">"telescope"</span>, <span class="hljs-string">"t"</span>, <span class="hljs-literal">false</span>, <span class="hljs-string">"check from telescope"</span>)
+
flag.Parse()
<span class="hljs-comment">//deal with non-file path, giving usage message</span>
<span class="hljs-keyword">if</span> <span class="hljs-built_in">len</span>(os.Args) == <span class="hljs-number">1</span> {
@@ <span class="hljs-number">-113</span>,<span class="hljs-number">11</span> +<span class="hljs-number">115</span>,<span class="hljs-number">27</span> @@ <span class="hljs-function"><span class="hljs-keyword">func</span> <span class="hljs-title">main</span><span class="hljs-params">()</span></span> {
<span class="hljs-keyword">return</span>
}
+ <span class="hljs-keyword">if</span> *telescopeFlag {
+ <span class="hljs-keyword">var</span> urls []<span class="hljs-keyword">string</span>
+ urls = parseFromTelescope()
+ <span class="hljs-keyword">if</span> *jflag {
+
+ checkURLJson(urls)
+ } <span class="hljs-keyword">else</span> {
+
+ fmt.Println()
+ fmt.Println(<span class="hljs-string">">> ***** UrlChecker is working now...... ***** <<"</span>)
+ fmt.Println(<span class="hljs-string">"--------------------------------------------------------------------------------------------------"</span>)
+ checkURL(urls)
+ }
+ }
+
<span class="hljs-comment">//use for loop to deal with multiple file paths</span>
i := <span class="hljs-number">1</span>
<span class="hljs-keyword">for</span> i+<span class="hljs-number">1</span> <= <span class="hljs-built_in">len</span>(os.Args) {
<span class="hljs-keyword">var</span> urls []<span class="hljs-keyword">string</span>
+
<span class="hljs-keyword">if</span> os.Args[i][<span class="hljs-number">0</span>] != <span class="hljs-string">'-'</span> {
<span class="hljs-comment">//call functions to check the availability of each url</span>
</code></pre>
<div>
<div>
Enter fullscreen mode
Exit fullscreen mode
</div>
</div>
</div><p> Hey PC readers,</p><p><br></p><p>Welcome back to my blog!</p><p>This week was after the study week and finally no more labs and PR every week. Phew! </p><p>Guess what! I had more 6-7 PRs and I completed Hacktoberfest! woohoo :D</p><p><br></p><p>Okay back to this week, I got too excited for Hacktoberfest completion. So this week was something new again. We talked about Telescope, I read on Slack and joined its channel for more info too. But never really worked on it. So this week, we had to do its setup and do a mini task with it. Well, not so mini, but yeah mini according to Telescope code lines. The setup was confusing at first, I got scared really when I read it the first time. There were so many technologies I never heard of. I read and closed it, whew, lots of readings to do first. But then fews days later, I opened it up again. Things made more sense this time. I started doing exactly what was told in the steps, WSL made the life easier. The setup was done, although I did came across an issue, here in the screenshot below it says to run the first command<code> sudo systemctl start docker</code></p><p> but that command was not working. I read a few blogs of the students who did it already and figured that doing the second command straight would do it. I did that and it worked. That was really helpful!</p><p></p><p></p><p><img src="" loading="lazy"> </p><p>Alright, rest everything worked fine. And I ran the Telescope server, it worked perfectly. I got my ten posts with its id and url on my 3000 port.</p><p><img src="" loading="lazy"> </p><p>Now all I had to do was to use my Link checker to get these urls instead of the ones I provided in the test.txt file that I gave manually.</p><p>I started and looked at my code, and realized that my code indeed was in good state and manageable that I did not have to make a lot of changes. The code was modularized too, so I didnt have to edit a lot of existing code. I created a new js file called restHandler where I added the code to receive these posts by using an endpoint which was </p><div><div>http://localhost:3000/posts</div></div><p>We would get our 10 posts from here and then I would save them to a new file called postsUrl.txt. The posts list was in Json format so I had to use JSON.stringify in order to use it in my file. I used axios for the fetching previously in my code and in this lab too. I did not make any changes to my fileHandler.js which was handling most of the previous labs code like clicolor, links status, status codes etc. All I added was the new option with </p><div><div>.option('-e, --endpoint <url>', 'URL to fetch post list');</div></div><p>My code was up and running. The gist can be found here for my code :<a href="https://gist.github.com/chawlapalak/15d026b30c8dd4801e714669f6566183"> https://gist.github.com/chawlapalak/15d026b30c8dd4801e714669f6566183</a></p><p><img src="" loading="lazy"> </p><p>See you in my next blog!</p><p>Best regards,</p><p>PC <br></p><p>This week was getting to know <a href="https://github.com/Seneca-CDOT/telescope">Telescope</a> week.</p><h4>Setting up Telescope</h4><p>I used mac to set up Telescope because I was aware that how painful it is to setup Telescope in Windows. Hence, I avoid it and I am glad that I did that. Because of that, it went smoothly. I installed docker which makes easier to run Telescope. So far, I did not encounter any major issues 🤞</p><h4>Lab 6</h4><p>The task of this lab was working with Telescope REST API. We have to grab the last 10 blog posts and check all the link inside each blogs. I could have just done it simpler way and be done with it, but I wanted to try something different and MULTIPLE ways. Yeah, I implemented this feature in 3 different ways.</p><h4>Version 1.0</h4><p>My first idea was using my tool inside Telescope project. To do that, first installed my tool inside Telescope. Without any change, my tool only works as CLI tool, not as a library. So, I searched how to call a CLI tool programmatically. There is a library called child-process which can be used to run any of my link chekcer’s available options. My link checker already has option called fbl -f fileName to read file and report links from the file. The coding I did to call the options with correct files were:</p><ul><li>grab the url form `http://localhost:3000/posts` which will give me an array of json object</li><li>loop through the array and fetch again `<a href="http://localhost:3000${post.url}">http://localhost:3000${post.url}</a>` to get each blog post</li><li>create temporary file with the content of the file: I used <a href="https://www.npmjs.com/package/tmp">tmp</a> library and fs.appendFile</li><li>call -f option where you pass the tmp file</li></ul><p>It took me a while with figuring out async call and how to call the options. That’s how I call the option from my link checker to Telescope.</p><pre>const { exec } = require('child_process');<br>const fbl = exec(`fbl -f ${fileName}`, function(error, stdout){...}</pre><figure><img src="https://cdn-images-1.medium.com/max/1024/1*INcb_tAyKRruh5q4K0fDkw.png" loading="lazy">Output of using fbl as CLI in Telescope</figure><h4>Version 1.1</h4><p>I was not fulfilled the way I did it because using as CLI required me to install another library. So, I decided to turn my tool into library. I thought it won’t take that long, but later I found out how wrong I was.</p><p>All my function was in index.js To make my tool as library, I need to organize it better and export the functions so that Telescope can access the functions too only by installing my tool.</p><p>I did a huge refactoring on my code. After making my code more modular, I implemented the main functionality of this lab in a separate function. The implementation was not difficult, but the Promise and async again gave me difficult time. It took me awhile to figure out how to add all the tmp files into an array which can be passed to readFile function.</p><p>To test it, I decided why not make an option on my tool too to use this feature. Now, if I run fbl -t , the output is</p><figure><img src="https://cdn-images-1.medium.com/max/1024/1*QFYaH695Xq7fAFu80gDqBA.png" loading="lazy">Output of running fbl -t</figure><h4>Version 1.2</h4><p>The second version made my tool as library. Now, I can use my tool from Telescope even more easily. All I did was literally 3 lines of coding and got the beautiful result:</p><pre>const fbl = require("findbrokenurl/src/util")<br>fbl.setDefaultConfig()<br>fbl.handleTelsescope()</pre><figure><img src="https://cdn-images-1.medium.com/max/1024/1*qcp5CPLCmqlqsnuPBqyxhw.png" loading="lazy">Output of using fbl as library in Telescope</figure><p>You can see the full codes for different versions in <a href="https://gist.github.com/sanji11/724bc5b24010ad9597ea9bb42dd27ca0">the gist file</a>. Do you think it was worth it?</p><img src="https://medium.com/_/stat?event=post.clientViewed&referrerSource=full_rss&postId=d40557f54604" loading="lazy"><p>Lab 6 was very cool! Also very annoying.</p><p>First off, cloning Telescope for my Windows Partition wasn’t hard, at all. I’m proficient with git, so that was easy. The annoying part was getting Docker ready for Windows.</p><p>First, I had to install Windows Subsystem for Linux. That wasn’t hard, it was fairly easy. So, with Linux installed and Docker installed, that must mean I’m ready, right? NOT SO FAST, EAGER MCBEAVER. I completely forgot that Docker only works for Version 2 of WSL, so I had to spend a while just waiting for Windows to convert it into WSL 2. Now… it’s time. I ran Docker and it generated this Document, detailing about the Telescope API, cool!</p><figure><img src="https://cdn-images-1.medium.com/max/1024/1*81rSzZQR4aBUdsPVH5vpjg.png" loading="lazy">The Generated File</figure><p>Well, now that I have everything ready, and Telescope is running, I made a simple addition to my Python script. Well.. does it work? Oh baby it does.</p><figure><img src="https://cdn-images-1.medium.com/max/865/1*1tU96Ghfr9s3ayohFdfejA.png" loading="lazy">Output from ConEmu</figure><p>To be honest, I enjoyed this Lab because I got to setup Docker for Windows. I felt a lot of emotions with WSL, but my goodness does it feel so good to get my code to work. This was a very spicy lab.</p><img src="https://medium.com/_/stat?event=post.clientViewed&referrerSource=full_rss&postId=4ca59ca6fe06" loading="lazy"><p>This week, we were introduced to the project our predecessors of the course established; <a href="https://github.com/Seneca-CDOT/telescope">telescope</a>. Telescope aggregates all of the students of DPS909s’ blogs into a single site. This is particularly helpful as many students choose different platforms for their blogs and some of those platforms such as <a href="https://rabautista.medium.com/">Medium</a> have a limit on how many of their articles you can access a day. Telescope helps overcome this hurdle and makes it much more convenient to read my fellow students’ blogs. We are after all quite used to mindlessly scrolling through social media, scrolling through <a href="https://telescope.cdot.systems/">Telescope</a> is a bit more enriching than a typical scroll through Instagram and the like(at least my Instagram feed which mostly consists of food, memes, and shenanigans of those particular few).</p><p>The lab for this week was to set up the telescope project on our own machines and use the <a href="http://localhost:3000/posts">http://localhost:3000/posts</a> running the API to retrieve the JSON for the most recent 10 posts on Telescope. In order to accomplish this, we had to install all the project dependencies before running the npm start command on the project. Setting up a project has such potential to be a rather onerous task; something compounded by a lack of documentation. Luckily, there was a <a href="https://github.com/Seneca-CDOT/telescope/blob/master/docs/CONTRIBUTING.md">CONTRIBUTING.md</a> for the project which made things a little easier. By following their <a href="https://github.com/Seneca-CDOT/telescope/blob/master/docs/environment-setup.md">environment setup instructions</a>, I was able to set up the project successfully. This was immensely helpful as I hadn’t heard of some of these technologies before. Admittedly, there was some confusion and I wasn’t entirely sure I was doing everything properly. However, after a couple of restarts and reading over the documentation for the technologies over, I was able to get port 3000 to run the API.</p><p>Next, we had to modify our <a href="https://github.com/rubyAnneB/linkReaper-cli">link checker project</a> to go through each of the blog posts returned by the API and search each of them for any dead links. I had a bit of experience working with APIs from the web class so I knew a little bit of what the flow of the program was going to be like. Since my project had an existing command (readwebsite) that reads a website and looks through the returned HTMl, it was mostly a matter of finding out how to read the JSON returned by the API and repurposing the readwebsite code to read through the blog posts.</p><p>While I was testing the code, I ran into a bit of a speed bump. With my fellow students writing about using <a href="http://localhost:3000/posts">http://localhost:3000/posts</a>, it was being picked up by linkReaper and tested. This had to do with the way I was searching through the files; through regex. I looked for a way to avoid picking up on the links and came across <a href="https://www.crummy.com/software/BeautifulSoup/bs4/doc/">BeautifulSoup</a> a python library for “pulling data out of HTML and XML files”. This did indeed pick up on the links in the HTML files. However, there was also the issue that some of the links in the href tags weren’t complete, For example, they were missing the protocol. This will require a bit of redesigning to figure out how to properly address the incomplete URLs. I’ve created an <a href="https://github.com/rubyAnneB/linkReaper-cli/issues/11">issue</a> so that this may be addressed in the future. In the meantime, the regex stays.</p><p>It was nice to be working in Python again, I’ve missed working on linkReaper. It was also interesting to be working with the Telescope project. It’s the biggest web project I’ve touched. It was interesting to see some familiar usernames and profile pics from the DPS909 slack sprinkled around the issues in the project. Since <a href="https://github.com/Seneca-CDOT/topics-in-open-source-2020/wiki/release-0.3">Release 0.3</a> has us all working on Telescope, I’ll be soon to join them.</p><img src="https://medium.com/_/stat?event=post.clientViewed&referrerSource=full_rss&postId=9239d77d3628" loading="lazy"><p>This week, we put our link checking program to use with telescope, an aggregated blogging site for all of the current and past OSD600 students dev blogs. Last week I blogged about working on the site by contributing to an issue. Our professor tasked us this week to check links for the 10 latest posts that appears in telescope using our link checking program from earlier this semester.</p><p></p><div><a href="https://1.bp.blogspot.com/-g21x_9koqvI/X6dcy3mpxEI/AAAAAAAAC64/sa1N3QPKfhIRlkvNYPXVkBU-ftYpm0iowCLcBGAsYHQ/s828/tele.PNG"><img src="https://1.bp.blogspot.com/-g21x_9koqvI/X6dcy3mpxEI/AAAAAAAAC64/sa1N3QPKfhIRlkvNYPXVkBU-ftYpm0iowCLcBGAsYHQ/w621-h151/tele.PNG" loading="lazy"></a></div><div><br></div><p></p><p>So unlike the other students that haven't worked on telescope yet, which you can read about 2 posts back, myself (and a few others I expect) have already got telescope setup and have long since beaten that huge struggle. I'd look at the slack chat and I see everyone running into problems left and right, and man do I really feel for them. Took me a while to get it running initially, but fortunately I can go straight to implementing the link checking feature in my program.</p><p>So I thought writing the feature would be fairly simple, and for the most part it was. First, using the api with the url 'localhost:3000/posts' I fetch a json array with the post ids for the first 10 posts. I had to query it by adding 'posts?per_page=10' in the end of the url to filter it down since it defaults to 30 posts. Then using the ids, I use fetch again to retrieve the page data, which I then pass on to my link checking function, there it retrieves the links through regex filtering, checks each one of them and outputs their results.</p><table><tbody><tr><td><a href="https://1.bp.blogspot.com/-VAI-g-DpQIg/X6dVyOw91dI/AAAAAAAAC6M/1xe9Rligj_Eg9_ipwRQ5NF_ujzRYVyDtQCLcBGAsYHQ/s908/telescopelinkcheckj.PNG"><img src="https://1.bp.blogspot.com/-VAI-g-DpQIg/X6dVyOw91dI/AAAAAAAAC6M/1xe9Rligj_Eg9_ipwRQ5NF_ujzRYVyDtQCLcBGAsYHQ/w599-h176/telescopelinkcheckj.PNG" loading="lazy"></a></td></tr><tr><td>The final code block</td></tr></tbody></table><p>However, I did run into a problem, and it was regarding some asynchronous code problems, which I've always been so confused about. And it led me down a java script asynchronous tutorial rabbit hole. I was trying to understand how the event loop works, how promises are queued and the order synchronous tasked are carried out, and I don't fully get it 100%, but I think I got it enough. My htmlVerify function uses a Promise.All to process fetching the links concurrently and outputs it. Well I tried console logging before calling htmlVerify, but the problem was that the console log line is synchronous and will process first before any of the links are outputted since promises are queued to the event loop. </p><p></p><div><a href="https://1.bp.blogspot.com/-T9Cs7N_gt28/X6dbedzT4HI/AAAAAAAAC6o/R0Q6ytFGbioxrVHQkpeDnQkrArJyd-TBwCLcBGAsYHQ/s903/image%2B%25281%2529.png"><img src="https://1.bp.blogspot.com/-T9Cs7N_gt28/X6dbedzT4HI/AAAAAAAAC6o/R0Q6ytFGbioxrVHQkpeDnQkrArJyd-TBwCLcBGAsYHQ/w530-h368/image%2B%25281%2529.png" loading="lazy"></a></div><div><br></div><br>So I fixed the problem by just console logging the post id as part of the output function that executes as part of the Promise.all chain. Then I got it working.<p></p><p><br></p><div><a href="https://1.bp.blogspot.com/-LnuUKZCc1_c/X6dZt4BugqI/AAAAAAAAC6Y/1tJOlZ8xD3g-YMD8h8-DnVP9rKOi9EL1gCLcBGAsYHQ/s670/telescopelinkcheck.gif"><img src="https://1.bp.blogspot.com/-LnuUKZCc1_c/X6dZt4BugqI/AAAAAAAAC6Y/1tJOlZ8xD3g-YMD8h8-DnVP9rKOi9EL1gCLcBGAsYHQ/w591-h355/telescopelinkcheck.gif" loading="lazy"></a></div><div><br></div><div>So then I generated a .diff file to post into gist to note the additions to my code and submitted my lab.</div><div><br></div><div>The gist file can be found here</div><div><br></div><div>[<a href="https://gist.github.com/joelazwar/0e14425eb3f6fc6e01e9ea9c8bdb34b3">https://gist.github.com/joelazwar/0e14425eb3f6fc6e01e9ea9c8bdb34b3</a>]</div><div><br></div><div><br></div><div>Thanks for reading</div><br><p></p>