<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Ali Deishidi</title>
    <description>The latest articles on Forem by Ali Deishidi (@sizief).</description>
    <link>https://forem.com/sizief</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F161396%2Fd0e86c83-1d9a-4066-8a0b-c4796dbcb04f.jpeg</url>
      <title>Forem: Ali Deishidi</title>
      <link>https://forem.com/sizief</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/sizief"/>
    <language>en</language>
    <item>
      <title>Crawling Glassdoor </title>
      <dc:creator>Ali Deishidi</dc:creator>
      <pubDate>Sun, 28 Apr 2019 10:48:28 +0000</pubDate>
      <link>https://forem.com/sizief/crawling-glassdoor-5140</link>
      <guid>https://forem.com/sizief/crawling-glassdoor-5140</guid>
      <description>&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fraw.githubusercontent.com%2Fsizief%2Fglassdoor-analyze%2Fmaster%2Fresult%2Ftotal-languages.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fraw.githubusercontent.com%2Fsizief%2Fglassdoor-analyze%2Fmaster%2Fresult%2Ftotal-languages.png" alt="header"&gt;&lt;/a&gt;&lt;br&gt;
So a week ago I read a post here about Ruby would kill Python in the future. There are lots of debates, as always, on the comment section of that post. Someone mentions that it does not matter who would kill whom, every language or tool is suitable for something; You have to pick right tool for right work.&lt;/p&gt;

&lt;p&gt;I think that is the right answer, however, it is important to consider that what market (or industry) thinks about tools and languages. Do they want a Ruby programmer as much as they want a Python developer? &lt;/p&gt;

&lt;p&gt;To find out, I created a small Ruby project which does these tasks:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Crawl Glass-door pages for predefined cities for software job ads&lt;/li&gt;
&lt;li&gt;Store pages&lt;/li&gt;
&lt;li&gt;Create anagram to count number of occurrences of each keyword (such as &lt;code&gt;Python&lt;/code&gt; or &lt;code&gt;Ruby&lt;/code&gt;
&lt;/li&gt;
&lt;li&gt;Generate &lt;code&gt;yml&lt;/code&gt; and &lt;code&gt;png&lt;/code&gt; files to visualize how industry is in need for each skill.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Here is the output for total ten cities around the world. Remember, this is the number of ads that contains each keyword. For example if ad contains &lt;code&gt;Java&lt;/code&gt; then it increase the number of total ads that contains this specific keyword. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fgithub.com%2Fsizief%2Fglassdoor-analyze%2Fblob%2Fmaster%2Fresult%2Ftotal-languages.png%3Fraw%3Dtrue" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fgithub.com%2Fsizief%2Fglassdoor-analyze%2Fblob%2Fmaster%2Fresult%2Ftotal-languages.png%3Fraw%3Dtrue" alt="Software Languages"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;And here is the number of technologies mentions:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fgithub.com%2Fsizief%2Fglassdoor-analyze%2Fblob%2Fmaster%2Fresult%2Ftotal-technologies.png%3Fraw%3Dtrue" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fgithub.com%2Fsizief%2Fglassdoor-analyze%2Fblob%2Fmaster%2Fresult%2Ftotal-technologies.png%3Fraw%3Dtrue" alt="Software Technologies"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h2&gt;
  
  
  So finally which language is in demand more than others?
&lt;/h2&gt;

&lt;p&gt;The answer is easy: &lt;code&gt;Java&lt;/code&gt;. But if you are looking for script languages, then the answer is &lt;code&gt;Python&lt;/code&gt;, &lt;code&gt;Javascript&lt;/code&gt; and then &lt;code&gt;Ruby&lt;/code&gt;. However there are interesting findings when you compare results for individual cities. &lt;/p&gt;

&lt;p&gt;For example this is an output for Amsterdam:&lt;br&gt;
&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fgithub.com%2Fsizief%2Fglassdoor-analyze%2Fblob%2Fmaster%2Fresult%2Famsterdam-languages.png%3Fraw%3Dtrue" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fgithub.com%2Fsizief%2Fglassdoor-analyze%2Fblob%2Fmaster%2Fresult%2Famsterdam-languages.png%3Fraw%3Dtrue" alt="Amsterdam stats"&gt;&lt;/a&gt;&lt;br&gt;
Notice anything unusual? yes &lt;code&gt;Scala&lt;/code&gt; is in demand even more than &lt;code&gt;Pyhton&lt;/code&gt; or &lt;code&gt;Javascript&lt;/code&gt;!&lt;/p&gt;

&lt;p&gt;See the rest of these reports &lt;a href="https://github.com/sizief/glassdoor-analyze/tree/master/result" rel="noopener noreferrer"&gt;here&lt;/a&gt;. It includes chart for &lt;code&gt;New york&lt;/code&gt;, &lt;code&gt;Berlin&lt;/code&gt;, &lt;code&gt;London&lt;/code&gt;, &lt;code&gt;Toronto&lt;/code&gt;, &lt;code&gt;Singapore&lt;/code&gt;, &lt;code&gt;Dubai&lt;/code&gt;, &lt;code&gt;Tallinn&lt;/code&gt; and etc.&lt;/p&gt;
&lt;h2&gt;
  
  
  About the project
&lt;/h2&gt;

&lt;p&gt;The structure is easy. First there is a &lt;a href="https://github.com/sizief/glassdoor-analyze/blob/master/config.yml" rel="noopener noreferrer"&gt;configuration file&lt;/a&gt; which you can define the cities, keywords, categories and etc.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;urls:
  - Tallinn;https://www.glassdoor.ca/Job/tallinn;jobs
job_types:  
  - software
  - back-end
  - front-end
category:
  - languages
  - technologies
languages:
  - java
  - javascript 
  - c 
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Then by running &lt;code&gt;client.rb&lt;/code&gt; it will get first URL from the configuration file, crawls the web page, saves all URL specific parameter for each listing page, gets the second page and repeats it until the last page.&lt;/p&gt;

&lt;p&gt;After that another class will crawl the web site again. This time it download the whole ad page and save it on disk.&lt;/p&gt;

&lt;p&gt;The third class then create an anagram of all predefined keywords and scans every document that we saved in previous step. We save the results as a &lt;code&gt;yml&lt;/code&gt; file then.&lt;/p&gt;

&lt;p&gt;Here is the sample of output:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;languages:
  java: 324
  javascript: 196
  c: 75
  c#: 140
  c++: 144
technologies:
  kafka: 41
  nosql: 60
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;At the end with the help of &lt;a href="https://github.com/topfunky/gruff" rel="noopener noreferrer"&gt;Gruff Gem&lt;/a&gt; we generate images from YAML files.&lt;/p&gt;

&lt;h2&gt;
  
  
  Side notes
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;This could be helpful if you are investigating your next career path or your next language to learn. Nothing serious more than that.&lt;/li&gt;
&lt;li&gt;The project is pretty much configurable. Just update the &lt;code&gt;config&lt;/code&gt; file: add what city you want, the first URL,  what keywords you are looking for and what categories. Then run it (wait minutes to get all the data) and check the output on result folder. &lt;a href="https://www.github.com/sizief/glassdoor-analyze" rel="noopener noreferrer"&gt;link to project&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Have fun!&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>glassdoor</category>
      <category>jobs</category>
      <category>ads</category>
      <category>ruby</category>
    </item>
  </channel>
</rss>
