<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Huy Bui</title>
    <description>The latest articles on Forem by Huy Bui (@williamhuybui).</description>
    <link>https://forem.com/williamhuybui</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F222002%2Fb2545637-a80c-4176-9bc5-242730582f56.jpg</url>
      <title>Forem: Huy Bui</title>
      <link>https://forem.com/williamhuybui</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/williamhuybui"/>
    <language>en</language>
    <item>
      <title>Datathon 2019</title>
      <dc:creator>Huy Bui</dc:creator>
      <pubDate>Sat, 09 Nov 2019 18:01:20 +0000</pubDate>
      <link>https://forem.com/williamhuybui/datathon-2019-2da2</link>
      <guid>https://forem.com/williamhuybui/datathon-2019-2da2</guid>
      <description>&lt;p&gt;I attended the Datathon 2019 on October 20th and it was so much fun.&lt;br&gt;
This year the event was hosted by Texas A&amp;amp;M University where I just graduated 3 months ago.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--ZJhPdpIa--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_66%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/wg5j590ioujkkcqlx8y9.gif" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--ZJhPdpIa--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_66%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/wg5j590ioujkkcqlx8y9.gif" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;For the first time, TAMU Datathon is banding together data enthusiasts from across multiple disciplines, skill levels, and universities. TAMU Datathon is a 24-hour non-stop coding event where companies, researchers, and hundreds of students immerse themselves in data science.&lt;/p&gt;

&lt;p&gt;This was probably the largest Data Science Hackathon in the US. There were 2032 Applications, 110 universities worldwide, and 15 company sponsors. Some big companies that sponsored the event are Facebook, ConocoPhillips, CBRE, Goldman Sachs, Shell, Walmart.&lt;/p&gt;

&lt;p&gt;There were learning and competition track. The learning track was for beginners who have little or no experience in data science. Attendees on this track could choose to attend a series of lectures or a number of different workshops created by global lead data scientists. On the competition track, participants can show off their programming skills which could help them win prizes or earn an internship opportunity with a company. &lt;/p&gt;

&lt;p&gt;I chose the competition track because it sounded so much fun. I joined the event Slack and found 3 friends who were also interesting in competition. Together, we created a team (4 math students) and worked non-stop on the problem. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--RkRF9VuX--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_66%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/gtz005sn2vheardohog2.gif" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--RkRF9VuX--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_66%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/gtz005sn2vheardohog2.gif" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;When I first came, I was overwhelmed by the size of the event. Here are some pictures describe the activity at Recreational Center conference room throughout the day&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--Jc1Kx6di--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/s8vkl9voyldp8ejedua3.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--Jc1Kx6di--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/s8vkl9voyldp8ejedua3.jpg" alt="At the beginning"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;and night...&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--cyCpPX_F--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/j2u2lpnifq4vdb1iqtoz.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--cyCpPX_F--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/j2u2lpnifq4vdb1iqtoz.jpg" alt="At night"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h1&gt;
  
  
  Challenge
&lt;/h1&gt;

&lt;p&gt;Every company brought here a challenge and the competitors could freely choose which one they want to do. There were three problems that I found entertaining:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Walmart:&lt;/strong&gt; use the image processing technique to detect the availability of products on the shelves. This is mainly to deal with Black Friday!&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;ConocoPhillips:&lt;/strong&gt; given a time series of about 150 sensors data on a drilling rig, predict when the machine has a malfunction.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Facebook:&lt;/strong&gt; create a mini google map for the Dodoma region, given the satellite images and residential houses location.&lt;/p&gt;

&lt;p&gt;I ended up choosing the Facebook challenge because of its complexity. The data consist of 64 high-resolution pictures described road network of the region approximately 30 km x 30 km in the vicinity of Dodoma, the capital of Tanzania&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--U8R8lTHs--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/aprc4s8rne7sjpd60tv9.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--U8R8lTHs--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/aprc4s8rne7sjpd60tv9.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;A sample of 4 images &lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--l8isoPO5--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/zfpkqinb9hddrab8afd5.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--l8isoPO5--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/zfpkqinb9hddrab8afd5.png" alt="Smaller Map"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Putting 64 images together we have the whole map&lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--kb_PXN1O--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/a7e3xrfi29ba5nqqknjj.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--kb_PXN1O--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/a7e3xrfi29ba5nqqknjj.png" alt="Whole Map"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Besides the pictures, there was a folder containing 64 .csv files with detail about &lt;strong&gt;each pixel&lt;/strong&gt; of the map and their coordinates!. Each pixel approximately represents 0.5 meters square in real space. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--RIol8bD8--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/mrlvkkhk7fsexoo1jod6.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--RIol8bD8--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/mrlvkkhk7fsexoo1jod6.png" alt="Map Data"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Another .csv file consists of population features&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--MbcTcSRt--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/q87i1ks784zhisyx0f0e.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--MbcTcSRt--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/q87i1ks784zhisyx0f0e.png" alt="Population Data"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The questions were: &lt;br&gt;
1) Find the closest street to each residential location&lt;br&gt;
2) Find the closest patch from one place to another.&lt;/p&gt;

&lt;p&gt;The data was two gigabytes and solving the second question required hours of training the neural networks. It was really hard working with big data and deliver a result in just one day. In fact, some of us did not sleep at all. &lt;/p&gt;

&lt;h1&gt;
  
  
  Presentation
&lt;/h1&gt;

&lt;p&gt;The presentation starts at 2 p.m of the following day. The host set up  a big timer on the projector. Every time it hit 4 minutes stop, all judges move to the next table to grade a different project. The best part about the presentation was I could talk to Facebook data scientists and listen to their valuable feedbacks. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--6VNisXMJ--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/kc9l8oggovx9iyupdvsr.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--6VNisXMJ--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/kc9l8oggovx9iyupdvsr.jpg" alt="The Presentation"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The picture of me and my team presenting to a Facebook judge&lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--vAYmehiq--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/vyi2k8dr6upnb35qiew7.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--vAYmehiq--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/vyi2k8dr6upnb35qiew7.jpg" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;After a tiring day, we took a group picture together. We were just strangers the day before and now we became much closer. This was because we tried so hard to &lt;strong&gt;complete our project&lt;/strong&gt;. I was excited to come back home and shared my experience with everyone in my data science Bootcamp and later, the &lt;strong&gt;dev.to&lt;/strong&gt; community.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--Z97uGdKX--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/53yhn0weqavrzpk6qr2p.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--Z97uGdKX--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/53yhn0weqavrzpk6qr2p.jpg" alt="My Team"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;For more pictures of the event, please visit &lt;a href="https://photos.google.com/share/AF1QipNNI34583oNfSJTFBSKMGqTz7DkC2ZykoxBKSSbGYrYucjQ4XAictgnNZ1r1oYBMA?key=UkdvcXhGU2RjR3l5eWZqdnBmWXlVeUoyUnZIMXRB"&gt;here&lt;/a&gt;. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--dlEOF0UV--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_66%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/h0q1pbq1w5kyywr68xzk.gif" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--dlEOF0UV--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_66%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/h0q1pbq1w5kyywr68xzk.gif" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

</description>
      <category>datascience</category>
      <category>python</category>
      <category>computerscience</category>
      <category>be</category>
    </item>
    <item>
      <title>Monty Hall Problem</title>
      <dc:creator>Huy Bui</dc:creator>
      <pubDate>Fri, 27 Sep 2019 19:28:50 +0000</pubDate>
      <link>https://forem.com/williamhuybui/monty-hall-problem-4oae</link>
      <guid>https://forem.com/williamhuybui/monty-hall-problem-4oae</guid>
      <description>&lt;p&gt;&lt;strong&gt;Monty Hall Problem&lt;/strong&gt; is a famous probability puzzle in statistics. It is named after Monty, the host of the television game show "Let's Makes a Deal". The brain teaser loosely replicates the game show concept and it goes like this:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--HUHJXhRr--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/dppd3jokklf3jbhb1sk0.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--HUHJXhRr--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/dppd3jokklf3jbhb1sk0.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;There are 3 doors. You will have to choose a door, and you will win whatever behind it. There is one door with a car. Each remaining door has a goat. First, you are asked to pick one of the doors. Next, Monty, who knows what's behind each of the doors, opens up one of the two doors you didn’t pick and reveal a goat. Finally, you are given the opportunity to either “stay” with your original choice, or “switch” to the remaining door. What is the best strategy to win the car? &lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;I first encountered this problem 4 years ago in a statistics class. It was hard enough that I spent the whole day thinking about it and still insist on 1/2 for both strategies. My explanation was: there are 3 doors, 1 door has a car. If the host opens one door that has the goat, one out of two remaining doors must have the car. So regardless of "switching" or "staying", the probability should be 1/2!   &lt;/p&gt;

&lt;p&gt;I was wrong. The tricky part of this problem is Monty knows what behind those doors so his decision has an influence on the outcome. Take a minute for yourself to think if this is true.&lt;/p&gt;

&lt;h2&gt;
  
  
  1) Intuitive answer
&lt;/h2&gt;

&lt;p&gt;Assuming that the three doors are: 1, 2, and 3&lt;br&gt;
Since probability = Event/Sample space we just need to find these variable and plug in the equation.&lt;br&gt;
Sample space here is the total of different ways that we choose the door and where the car actually is. There are 6 possibilities which are illustrated in the following table:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--zOms1_zT--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/fpeijsqol07658x42ow6.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--zOms1_zT--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/fpeijsqol07658x42ow6.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Note that in this table, Monty has already eliminated the door with the goat. &lt;br&gt;
As you can see, the probability of winning for "staying" strategy is 1/3 while "switching" is 2/3.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--_cIpY8LG--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_66%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/2izk7doe64ai0ndyzo6w.gif" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--_cIpY8LG--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_66%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/2izk7doe64ai0ndyzo6w.gif" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  2) Bayes formula method
&lt;/h2&gt;

&lt;p&gt;This is another approach for people who loves Bayes! &lt;br&gt;
Let P(A|B) denote the probability of event A given that event B already happened &lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--d7drKCZl--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/jpcf6pe6q8bsy5dysusl.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--d7drKCZl--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/jpcf6pe6q8bsy5dysusl.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;If we know how to translate the problem into events, then everything becomes straight forward. Let:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;A be the event that is choosing the door with the car on the first choice. &lt;/li&gt;
&lt;li&gt;B be the event that Monty eliminates one door that has a goat. 
So we are looking for P(A|B), the "staying" strategy.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Let write down things that we know: &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;P(B|A) = 1, since Monty always choose the right door regardless of any given condition&lt;/li&gt;
&lt;li&gt;P(A) =1/3, there is a 1/3 chance of opening the car door, without knowing anything. &lt;/li&gt;
&lt;li&gt;
To understand how to find P(B), let's look at the diagram:
&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--Gj3i8JDj--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/44rb00z0z8ybzekknog0.png" alt="Alt Text"&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--msxfAtmA--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/t11nr5ugso3dqxm5j4az.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--msxfAtmA--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/t11nr5ugso3dqxm5j4az.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;There are only 2 possible strategies, either stay or switch. Thus, "switching" probability is the complement of staying, which is 1-1/3=2/3.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;If&lt;/strong&gt; Monty does not know which door has the goat. This is equivalent to P(B|A)=1 and P(B|¬A)=1/2. Plug them back to the equation and it returns P(B|A)=1/2. This is the part that I was confused. &lt;/p&gt;

&lt;h2&gt;
  
  
  3) Simpler answer
&lt;/h2&gt;

&lt;p&gt;With the "staying" strategy, you can only win if you choose the door with a car at first. Since there are 3 doors, the probability of choosing a door with a car is 1/3. So the "switching" strategy is 2/3. This provide us a general observation for more than 3 doors Monty Hall problem. &lt;/p&gt;

&lt;h2&gt;
  
  
  4) What if?
&lt;/h2&gt;

&lt;p&gt;What if Monty Hall decides to reveal k doors out of n doors where the maximum number of k is n-2, will it affect the strategy at all?&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--yFcKLXJ0--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/xs7yzx0m7cvotz7868kf.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--yFcKLXJ0--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/xs7yzx0m7cvotz7868kf.jpg" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;"Switching" in this case means not keeping the original choice&lt;/p&gt;

&lt;h4&gt;
  
  
  Case A: k=n-2
&lt;/h4&gt;

&lt;p&gt;This case is similar to the original problem. Note that you cannot switch to the door that you chose. So "Staying" win rate is always 1/n and switching win rate, in this case, is (n-1)/n&lt;/p&gt;

&lt;h4&gt;
  
  
  Case B: k&amp;lt;n-2
&lt;/h4&gt;

&lt;p&gt;The probability of choosing the door with a car at the beginning is still 1/n. Thus "staying" = 1/n&lt;br&gt;
In the "switching" strategy, your final choice cannot be your first choice. Thus, you don't want to choose the first door which has the car at the beginning. The probability of not choosing it is (n-1)/n. The k-th switch is also when your pick, there are n-k door(s) left but you must exclude your first door. Thus, the probability of picking the prized door in the last step is 1/(n-k-1). Together we have: &lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--sRPtj4KP--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/iu2vgqi7fofsskwhgg0k.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--sRPtj4KP--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/iu2vgqi7fofsskwhgg0k.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;It shows that the "switching" strategy is always better than "staying" strategy.&lt;/p&gt;

&lt;h2&gt;
  
  
  5) Simulation
&lt;/h2&gt;

&lt;p&gt;Let run it on python to see if our formula work. Let n be the number of doors and r is the number of door Monty wants to reveal&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--SDVjtVgK--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/9ohcnr66xuc4nnu14670.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--SDVjtVgK--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://thepracticaldev.s3.amazonaws.com/i/9ohcnr66xuc4nnu14670.jpg" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The graph shows that the statistical result getting close to our actual answer when increasing the number of iteration. &lt;/p&gt;

&lt;p&gt;So, you know what you should do if you play a similar game like this next time.&lt;/p&gt;




&lt;p&gt;For more information about the code, please visit my &lt;a href="https://github.com/williamhuybui/Monty_Hall_problem"&gt;Githup repository&lt;/a&gt;.&lt;/p&gt;

</description>
      <category>statistics</category>
      <category>bayesian</category>
      <category>datascience</category>
      <category>python</category>
    </item>
    <item>
      <title>Blockbuster Movie Trends</title>
      <dc:creator>Huy Bui</dc:creator>
      <pubDate>Fri, 20 Sep 2019 00:40:55 +0000</pubDate>
      <link>https://forem.com/williamhuybui/blockbuster-movie-trends-h94</link>
      <guid>https://forem.com/williamhuybui/blockbuster-movie-trends-h94</guid>
      <description>&lt;p&gt;Imagine that you are the CEO of a trillion-dollar company like Microsoft. One night, after watching a movie at the AMC with your girlfriend, you thought about entering the Hollywood industry and make a lot of money for your company. Before doing that, you need to hear a little insight about the current movies market. Consequently, you consult with a data scientist and here is what he has:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2F1gxlp3iu6c87lr5os59a.gif" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2F1gxlp3iu6c87lr5os59a.gif" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Data
&lt;/h2&gt;

&lt;p&gt;The objective of this project is to find the trend of popular movies in the U.S. This blog will explain the data through visualization that requires no technical knowledge to understand.  However, a &lt;a href="https://github.com/williamhuybui/Blockbuster-Movie-Trends" rel="noopener noreferrer"&gt;GitHub repository&lt;/a&gt; is provided for those interested in the technical details of this project in regards to Data Science.  &lt;/p&gt;

&lt;p&gt;The data come from 220 top gross revenue movies from 2009 to 2019. This is obtained through the API requests from the &lt;a href="https://www.themoviedb.org/" rel="noopener noreferrer"&gt;movies database&lt;/a&gt;. Also, a list of the 1000 best actor is collected from &lt;a href="https://imdb.com/" rel="noopener noreferrer"&gt;imdb&lt;/a&gt;.&lt;br&gt;
The program is written in Python and is presented by Tableau.&lt;/p&gt;

&lt;h2&gt;
  
  
  Question
&lt;/h2&gt;

&lt;h3&gt;
  
  
  1) What is the average profit and rating?
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Fw6c2mj0uib920cws23fv.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Fw6c2mj0uib920cws23fv.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The average rating is 6.845, ranging from 4.8 (Independent Day) to 8.3 (Inception).  &lt;/p&gt;

&lt;p&gt;Avengers: Endgame unsurprisingly made more money than any movie in history with $2.4b in profit. The lowest one come from Dark Phoenix with $45m. The average profit of this top 220 movies is $538m.&lt;/p&gt;

&lt;h3&gt;
  
  
  2) Which MPAA rating attracts the most audience?
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2F58s1nkobme8rz3j20nd1.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2F58s1nkobme8rz3j20nd1.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Teen friendly movies, &lt;strong&gt;PG-13&lt;/strong&gt;, are top of the chart for the Motion Picture Association of America (MPAA) rating with over 50% of the top-grossing movie market. The reason is simple, it has the widest range of audience.&lt;/p&gt;

&lt;h3&gt;
  
  
  3) When is the best time to release a movie?
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Ffhafcldstr34i7fk6ci3.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Ffhafcldstr34i7fk6ci3.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The best months for going to the box office are : &lt;br&gt;
Summer blockbusters: &lt;strong&gt;May, June, July&lt;/strong&gt;.&lt;br&gt;
Thanksgiving and Christmas time: &lt;strong&gt;November, December&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2F875pkc2nw3yfjyobr5in.gif" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2F875pkc2nw3yfjyobr5in.gif" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  4) Best genres to invest
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2F7xtbfltjz32i9cu12i4a.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2F7xtbfltjz32i9cu12i4a.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;No 1: &lt;strong&gt;Action-Adventure&lt;/strong&gt; such as Wonder Woman, Captain America, The Hunger Games.&lt;br&gt;
No 2: &lt;strong&gt;Action-Fantasy&lt;/strong&gt; such as Lord of the Rings, Maleficent, Star Wars.&lt;br&gt;
No 3: &lt;strong&gt;Animation-Family&lt;/strong&gt; such as Lion King, Inside Out, Wall-E.&lt;/p&gt;

&lt;h3&gt;
  
  
  5) Profit expectancy
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2F9zw26ioiqjnbftpm8peu.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2F9zw26ioiqjnbftpm8peu.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The linear regression line shows the correlation between your profit and investment with the formula: &lt;strong&gt;profit=1.55*budget+3.08*10^8&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;For example, let's assume that every step from marketing, production, leadership, to choosing the right studio is nearly perfect, if your company invest in $100m, then you should expect to see $455m in return!&lt;/p&gt;

&lt;p&gt;It also worths to mention some of the special cases:&lt;br&gt;
&lt;strong&gt;Avatar&lt;/strong&gt; is the most profitable movie of all time, where profitability is derived from profit/budget ratio. One reason for that is Avatar is the &lt;strong&gt;first 3D movie&lt;/strong&gt; released in the movie theater.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Avengers: Endgames&lt;/strong&gt; is the next outliner. The sequence is the culmination of 22 movies released over 11 years!&lt;/p&gt;

&lt;p&gt;Therefore, new technology or careful sequel planning is the game changer! &lt;/p&gt;

&lt;p&gt;Justice League is on the other side of the spectrum. The movie falls at the bottom of this list due to the failure of its leadership. More about it &lt;a href="https://www.forbes.com/sites/markhughes/2018/03/23/justice-league-was-a-sad-failure-in-studio-leadership/#703a215e5e5d" rel="noopener noreferrer"&gt;here&lt;/a&gt;.&lt;/p&gt;

&lt;h3&gt;
  
  
  6) The association of genres and profit
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Fh3mbqwty4547eno4sq2e.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Fh3mbqwty4547eno4sq2e.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Take a look closer to the best three genres. The trend lines show that if the budget is:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Less than $125m&lt;/strong&gt;, you should invest in &lt;strong&gt;Animation&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Between $125 and $182&lt;/strong&gt;,  you should invest in &lt;strong&gt;Action&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;More than $182m&lt;/strong&gt;, you should invest in &lt;strong&gt;Adventure&lt;/strong&gt;.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  7) How many good actors should we hire?
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Fqv06967s6qtft963lgy3.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Fqv06967s6qtft963lgy3.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The casting is a big factor in our calculation. Hiring popular actors can significantly increase the anticipation/attention of the audience. A stunning moment of Gal Gadot could go viral on the internet for a few weeks.&lt;br&gt;
&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2F9c7kq5qy5sza2u14p8il.gif" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2F9c7kq5qy5sza2u14p8il.gif" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;We also want to hire actors that can roller coaster audience's emotion. The list of 1000 best actor capture all of these features. Some of the people on the list are Jack Nicholson,  Leonardo DiCaprio,  Morgan Freeman. &lt;/p&gt;

&lt;p&gt;The graph shows that action, adventure, and family genres have an average of &lt;strong&gt;5 A-rank actors&lt;/strong&gt;. Interestingly, history movies have the most, 7 actors on average. Though, it has been proved that visual effects can overshadow acting skill.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Fcvnp5b7h3wvn1qi572rn.gif" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Fcvnp5b7h3wvn1qi572rn.gif" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Summary:
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Top 3 genres for a high grossing film are &lt;strong&gt;Action, Animation, and Adventure&lt;/strong&gt; with action being the highest rank. The profitability also &lt;strong&gt;depends on the budget range&lt;/strong&gt;. &lt;/li&gt;
&lt;li&gt;Release movies during the &lt;strong&gt;summer months&lt;/strong&gt; and before and during &lt;strong&gt;winter break&lt;/strong&gt;. &lt;/li&gt;
&lt;li&gt;Hire at least &lt;strong&gt;5 movie stars&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Rate it &lt;strong&gt;PG13&lt;/strong&gt;.&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>python</category>
      <category>datascience</category>
      <category>sql</category>
      <category>database</category>
    </item>
    <item>
      <title>First Post: Transition from Mathematics to Data Science</title>
      <dc:creator>Huy Bui</dc:creator>
      <pubDate>Thu, 05 Sep 2019 14:05:56 +0000</pubDate>
      <link>https://forem.com/williamhuybui/first-post-transition-from-mathematics-to-data-science-12cl</link>
      <guid>https://forem.com/williamhuybui/first-post-transition-from-mathematics-to-data-science-12cl</guid>
      <description>&lt;p&gt;This post is a personal story about how I "quitted" math and found data science. I do not intend to encourage anyone to do the same. There are a lot of factors that influence my choice, so please be careful. &lt;br&gt;
&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Fdf05rnfgor7zzhhf3pjt.gif" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Fdf05rnfgor7zzhhf3pjt.gif" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Why not Math?
&lt;/h2&gt;

&lt;p&gt;Three months ago, I was still a Ph.D. student in Mathematics. I was struggling with the decision of dropping out of the program. The process of making that decision was a two-year nightmare.  Galois theory, expander graph, chaos theory, discharging method... They are all interesting topics that I spent a lot of time studying them myself. But the more I studied the more I pushed myself away from then reality. I woke up every morning feeling completely lost. I will explain my internal conflict with an example in set theory. The Continuum Hypothesis states that:&lt;/p&gt;

&lt;p&gt;&lt;code&gt;There does not exist a set with a size less than the reals and no set strictly greater than the natural numbers. - Georg Cantor&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;If we can find the 1-1 correspondence map between two sets then they have the same size (cardinality). In another word, natural number, integer, and a rational number have the same cardinality, which is infinity. Is there anything bigger than infinity? &lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Fge720gciwfuuzpdc5vyv.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Fge720gciwfuuzpdc5vyv.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The answer is yes! There are sets such as the set of real numbers, that it is impossible to find the 1-1 map between itself and the set of natural number. You can quickly see why by investigating the countability of each set. Natural number set is countable while the real number is not. The Continuum hypothesis emphasis on what truly in-between natural number and real number?&lt;/p&gt;

&lt;p&gt;The study of different type of infinity is very interesting. However, the results are impractical to society. Why are there people (Set Theorist) spend their whole life study about infinity? I can't answer this question myself. I respect mathematician in general because of their unspoken contribution to science or art. Nonetheless, my mentality is not built for processing this 9-to-5. This truth took me a long time to learn.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2F7f2ymdaxwqdnq5tzyysa.gif" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2F7f2ymdaxwqdnq5tzyysa.gif" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;One important point is that my family lives under the poverty line. They always support my career decision and I am grateful about that. But as the eldest son in the family, I still feel seriously bad for not being able to provide. Graduate-level pure math is a form of art which takes seven years to master. On the contrary, a Ph.D. in pure math is unhireable in most industry and also competitive in academia. Thus, it is not sensible for me to continue.&lt;/p&gt;

&lt;h2&gt;
  
  
  Let says that family was not an issue. Would you continue?
&lt;/h2&gt;

&lt;p&gt;I started the Ph.D. program in math with full funding. I imagined myself finished the program with my head held high in a standing ovation. However, Ph.D. in math takes more than just the liking. It requires a lot of commitment, sacrifice, and mental capability. I was not ready for it at all. I came to the battle arena with just a wooden sword.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Fd4qqr74eoqi70wm1mz0b.gif" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Fd4qqr74eoqi70wm1mz0b.gif" alt="Alt Text"&gt;&lt;/a&gt; &lt;/p&gt;

&lt;p&gt;In the very first month of grad school, I suffered from &lt;strong&gt;&lt;em&gt;imposter syndrome&lt;/em&gt;&lt;/strong&gt;. My school was one of the tops of the nation in abstract algebra. I freaked out by my classmates because they always "quietly nod their head" whenever the teacher wrote, both literally and metaphorically, &lt;strong&gt;Greek&lt;/strong&gt; on the board. I worked my butt off to get better but the progress is hardly noticed. I was a big fish in a small pond before I came, but here I am nobody in the sea of sharks and whales. By the end of the first years, I got an existential crisis. I was so depressed that I sent out apology letters to some professors in my undergrad, telling them that I was a failure, I did not live up to their expectation. Luckily, I managed to pass all the classes and maintain good standing in the program. However, I felt tired and my head was cloudy all the time. I learned that something was not right. But, I was too afraid to think about it. &lt;/p&gt;

&lt;h2&gt;
  
  
  What makes me choose data science?
&lt;/h2&gt;

&lt;p&gt;After the trauma, I reevaluated my goal in life. I tried to convince myself about opportunities outside academia. However, I felt ashamed for not be able to finish what I started. One day, I saw a post from a Ph.D. fellow that changed my mind, &lt;br&gt;
&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2F37y6vomih02kn5ki0h94.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2F37y6vomih02kn5ki0h94.jpg" alt="Pic"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The advice soothed the wound. A few days later, I talked to the department chair about my situation. This was four months before my graduation.&lt;/p&gt;

&lt;p&gt;I looked for a job that required a Master in a quantitative field and Data Science caught my attention. I know a little bit Python, combinatorics, and graph theory. I just hoped that my time here would not be wasted. &lt;/p&gt;

&lt;h2&gt;
  
  
  How do I feel now?
&lt;/h2&gt;

&lt;p&gt;I came back home after graduation and live with my parents and three younger siblings. It takes me a while to adjust my daily routines. I have been gone for two years and a lot of things had changed. My brother had grown a lot. He endures the anxiety disorder and gave my parents a headache many times. I just wish that I could be here sooner to be his brother. I also had a lot of mom food which is like heaven compared with what I ate back then. &lt;/p&gt;

&lt;p&gt;I started a Data Science program at Flatiron a week ago. I had to pay a lot of money to get in but I feel good about it so far. The program is challenging enough to be fun. There is a lot of logical thinking in programming which is similar to math. Although the curriculum is condensed and laborious, the staffs are very helpful. They just graduated from the program so they understand our feeling and give a lot of input. The students here came from different backgrounds: oil and gas, geology, IT, teacher,... We have one thing in common that we all try hard to advance our career.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Fqp54f3ulxptugeooetp9.gif" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fthepracticaldev.s3.amazonaws.com%2Fi%2Fqp54f3ulxptugeooetp9.gif" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Ending notes
&lt;/h2&gt;

&lt;p&gt;I never regret my two years in graduate school.  I made a bunch of smart friends and experienced college life. I also have learned a lot about myself through hardship.&lt;/p&gt;

&lt;p&gt;My piece of advice for friends out there who are still finding their passion. &lt;strong&gt;&lt;em&gt;It's hard to change but it worths the try!&lt;/em&gt;&lt;/strong&gt;&lt;/p&gt;

</description>
      <category>math</category>
      <category>datascience</category>
      <category>carrerchange</category>
      <category>transition</category>
    </item>
  </channel>
</rss>
