<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Purva Masurkar</title>
    <description>The latest articles on Forem by Purva Masurkar (@purvanova213).</description>
    <link>https://forem.com/purvanova213</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F931671%2F70b2180d-6eca-4b2a-b559-3c6e270cb85a.jpeg</url>
      <title>Forem: Purva Masurkar</title>
      <link>https://forem.com/purvanova213</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/purvanova213"/>
    <language>en</language>
    <item>
      <title>Day 5-7 of 100 Days of ML Code: Data Manipulation with Pandas</title>
      <dc:creator>Purva Masurkar</dc:creator>
      <pubDate>Fri, 17 Mar 2023 17:20:43 +0000</pubDate>
      <link>https://forem.com/purvanova213/day-5-7-of-100-days-of-ml-code-data-manipulation-with-pandas-3a3k</link>
      <guid>https://forem.com/purvanova213/day-5-7-of-100-days-of-ml-code-data-manipulation-with-pandas-3a3k</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;"Embrace the struggle, trust the process, and persist with effort."&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;To start using the Pandas library, we need to import it into our Python environment. For this example, I downloaded a Shop Customer Data dataset from Kaggle that provides a detailed analysis of an imaginary shop's ideal customers. To read the CSV file into our Pandas DataFrame, we can use the pd.read_csv() function.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Indexing&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Indexing in Pandas refers to the process of selecting specific rows and/or columns from a Pandas DataFrame or Series. There are several ways to perform indexing in Pandas:&lt;/p&gt;

&lt;p&gt;iloc(): This method allows you to select rows and columns by integer location.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--KoaQjjWp--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/3xhwnd6ffd6my01usg5c.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--KoaQjjWp--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/3xhwnd6ffd6my01usg5c.png" alt="Image description" width="397" height="576"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;loc(): This method allows you to select rows and columns by label or boolean mask. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--9fLXoYuh--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/t93ij5eh7vgge19qmerq.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--9fLXoYuh--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/t93ij5eh7vgge19qmerq.png" alt="Image description" width="627" height="572"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Boolean Indexing: This method allows you to filter a DataFrame by a boolean condition.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--TimjuH66--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/s3rqqzkpww0yihgcw1z7.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--TimjuH66--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/s3rqqzkpww0yihgcw1z7.png" alt="Image description" width="880" height="434"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Filtering&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Filtering in Pandas refers to the process of selecting a subset of data from a DataFrame based on certain conditions.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--AaZkqpME--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/udih16yuwkpq5i2xnams.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--AaZkqpME--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/udih16yuwkpq5i2xnams.png" alt="Image description" width="880" height="454"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--XPiktV5d--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/tao7xu2cqqrv0rdifzx2.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--XPiktV5d--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/tao7xu2cqqrv0rdifzx2.png" alt="Image description" width="880" height="456"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Updating Rows and Columns&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Updating rows and columns in a Pandas DataFrame involves changing the values of specific cells, rows, or columns.&lt;/p&gt;

&lt;p&gt;Updating a specific cell:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--dU97jBRV--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/h1x440pmorey0dg9binc.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--dU97jBRV--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/h1x440pmorey0dg9binc.png" alt="Image description" width="350" height="645"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Updating a specific row:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--at0virBB--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/215p9a58k3ie14vq5257.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--at0virBB--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/215p9a58k3ie14vq5257.png" alt="Image description" width="880" height="242"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Updating a specific column:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--xApoGUcX--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/9bteqwr44bmlniuzwod8.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--xApoGUcX--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/9bteqwr44bmlniuzwod8.png" alt="Image description" width="417" height="327"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;In Pandas, there are two methods along with filter that are commonly used for transforming data:&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;apply: applies a function to a DataFrame or a Series along an axis. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--Ng4dTEHn--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/69ix2zlo0zbzasg9teac.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--Ng4dTEHn--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/69ix2zlo0zbzasg9teac.png" alt="Image description" width="537" height="517"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;map: applies a function to each element of a Series. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--Q9x0foJB--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/v1n57xt8zek4h1igftm9.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--Q9x0foJB--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/v1n57xt8zek4h1igftm9.png" alt="Image description" width="356" height="283"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Adding/ Removing Rows and Columns&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;append(): You can use this method to add one or more rows to an existing DataFrame. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--ZjDmxK0q--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/yz80se1y2jalrr54hydm.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--ZjDmxK0q--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/yz80se1y2jalrr54hydm.png" alt="Image description" width="450" height="413"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;drop(): You can use this method to remove one or more rows from a DataFrame.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--TqLZq63P--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/lsuo3tg5tk3i8w7hznug.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--TqLZq63P--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/lsuo3tg5tk3i8w7hznug.png" alt="Image description" width="293" height="578"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Sorting Data&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Sorting data is an important operation in data analysis. In Pandas, you can sort data using the sort_values() method.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--5D8LV5WB--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/hsstz53softo4v661mxe.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--5D8LV5WB--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/hsstz53softo4v661mxe.png" alt="Image description" width="448" height="627"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--UuQWoXQn--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/ouaukvkq3id2midhoi9e.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--UuQWoXQn--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/ouaukvkq3id2midhoi9e.png" alt="Image description" width="407" height="308"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Grouping and Aggregating Data&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Grouping and aggregating data is a common operation in data analysis. In Pandas, you can group data using the groupby() method and aggregate it using the agg() method.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--IfxOkBfl--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/782655vwmgl6v47zwmi6.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--IfxOkBfl--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/782655vwmgl6v47zwmi6.png" alt="Image description" width="571" height="282"&gt;&lt;/a&gt;&lt;/p&gt;

</description>
      <category>machinelearning</category>
      <category>codenewbie</category>
      <category>womenintech</category>
      <category>100daysofcode</category>
    </item>
    <item>
      <title>Day 4 of 100 Days of ML Code: Pandas Library</title>
      <dc:creator>Purva Masurkar</dc:creator>
      <pubDate>Mon, 13 Mar 2023 18:35:25 +0000</pubDate>
      <link>https://forem.com/purvanova213/day-4-of-100-days-of-ml-code-pandas-library-2221</link>
      <guid>https://forem.com/purvanova213/day-4-of-100-days-of-ml-code-pandas-library-2221</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;"Programs must be written for people to read, and only incidentally for machines to execute." - Harold Abelson and Gerald Jay Sussman, Structure and Interpretation of Computer Programs&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  Introduction to Pandas
&lt;/h2&gt;

&lt;p&gt;Pandas is an open-source Python library that is widely used for data manipulation and analysis.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Summarizes the data.&lt;/li&gt;
&lt;li&gt;Read and write different formats of file like CSV, JSON, EXCEL, HTML etc.&lt;/li&gt;
&lt;li&gt;We can filter and modify the data based on multiple conditions.&lt;/li&gt;
&lt;li&gt;We can merge multiple files.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Difference between Attributes and Methods&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Attributes are used to represent properties or state of an object, while methods are used to represent behaviors or operations on its data. Attributes are accessed using the dot notation without parentheses, while methods are called using the dot notation with parentheses and optional arguments.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Importing Pandas&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;To use the Pandas library in Python, we first need to import it into our code. There are different ways to import Pandas, but the most common one is using the import statement&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--T51nQsvM--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/bljbuztcfk1f8u7bdbhe.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--T51nQsvM--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/bljbuztcfk1f8u7bdbhe.png" alt="Image description" width="880" height="244"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;This statement imports the entire Pandas library, and we can access its functions and classes using the pd namespace.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Reading and Viewing the csv file&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;To work with real-world data, I have selected the Stack Overflow Annual Developer Survey file, which is a widely used dataset for data analysis and machine learning. This dataset contains information about the demographics, education, employment, and technology preferences of software developers from different parts of the world. The survey is conducted annually by Stack Overflow, a popular Q&amp;amp;A website for programmers.&lt;/p&gt;

&lt;p&gt;To read a CSV file using Pandas, we use the pd.read_csv() function.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--MR3twClr--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/0akacqrgg4aiiqqbz9xc.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--MR3twClr--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/0akacqrgg4aiiqqbz9xc.png" alt="Image description" width="880" height="313"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;df.head(n): Displays the first n rows of the DataFrame (by default, n=5).&lt;/li&gt;
&lt;li&gt;df.tail(n): Displays the last n rows of the DataFrame (by default, n=5).&lt;/li&gt;
&lt;li&gt;df.shape: Returns a tuple containing the number of rows and columns in the DataFrame.&lt;/li&gt;
&lt;li&gt;df.columns: Returns a list of column names in the DataFrame.&lt;/li&gt;
&lt;li&gt;df.dtypes: Returns the data type of each column in the DataFrame.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--Zkw59Dwl--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/n5lx2eyzxffkipakr5ax.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--Zkw59Dwl--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/n5lx2eyzxffkipakr5ax.png" alt="Image description" width="880" height="379"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--3j06GA_e--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/eij57q0gztl4gmmgw5df.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--3j06GA_e--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/eij57q0gztl4gmmgw5df.png" alt="Image description" width="826" height="731"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;To check null values in data we use. This function counts the total number of missing data from columns and sums them up.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--9J1X-kz9--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/i8m3feh2lsqah4atyb69.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--9J1X-kz9--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/i8m3feh2lsqah4atyb69.png" alt="Image description" width="381" height="653"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;To give summary of the data we use. It only includes columns that are numerical and not strings.&lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--Qr9mdNFO--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/k7ygrfqedewncq0487xl.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--Qr9mdNFO--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/k7ygrfqedewncq0487xl.png" alt="Image description" width="551" height="417"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Gives all information of column such as number of rows, missing value, data types.&lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--gSAIDT79--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/qeppvwjvxrv6p7p9lwgs.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--gSAIDT79--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/qeppvwjvxrv6p7p9lwgs.png" alt="Image description" width="582" height="570"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;We are not allowed to see all columns so we use this function&lt;br&gt;
&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--ih7lbsPi--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/b2tx3tbwylalvjabd0wb.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--ih7lbsPi--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/b2tx3tbwylalvjabd0wb.png" alt="Image description" width="880" height="286"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;DataFrame&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;In Pandas, a DataFrame is a two-dimensional table-like data structure that consists of rows and columns. Once created, a DataFrame can be manipulated, transformed, and analyzed using various Pandas functions and methods.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--00k_KVFu--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/xu9tlan98q68d6qg2luv.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--00k_KVFu--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/xu9tlan98q68d6qg2luv.png" alt="Image description" width="442" height="685"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;iloc and loc are two methods in Pandas that allows to select subsets of rows and columns from a DataFrame based on their index or label values. iloc is used for integer-based indexing, while loc is used for label-based indexing.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--tWMgJVmP--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/drirphb52jyemy13fzg9.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--tWMgJVmP--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/drirphb52jyemy13fzg9.png" alt="Image description" width="310" height="711"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Conclusion&lt;/strong&gt;&lt;br&gt;
I am interested in continuing my exploration of the Pandas library because there is a lot to learn from it that can be helpful for my future applications. I will continue listing my daily progress and try to remain consistent. Please do share your feedback on how I can my 100daysofcode challenge more productive. I'll see you tomorrow for my daily update.&lt;/p&gt;

</description>
      <category>machinelearning</category>
      <category>100daysofcode</category>
      <category>womenintech</category>
      <category>codenewbie</category>
    </item>
    <item>
      <title>Day 3 of 100 Days of ML Code: NumPy</title>
      <dc:creator>Purva Masurkar</dc:creator>
      <pubDate>Sun, 12 Mar 2023 17:38:02 +0000</pubDate>
      <link>https://forem.com/purvanova213/day-3-of-100-days-of-ml-code-numpy-5558</link>
      <guid>https://forem.com/purvanova213/day-3-of-100-days-of-ml-code-numpy-5558</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;"Success is not final, failure is not fatal: it is the courage to continue that counts." - Winston Churchill&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;On the third day of my #100daysofcode challenge, I learned NumPy (Numerical Python) a Python library that is used for scientific computing and data analysis. I referred various resources for this &lt;a href="https://courses.analyticsvidhya.com/courses/take/Machine-Learning-Certification-Course-for-Beginners/lessons/26302998-basics-of-numpy-in-python"&gt;Baiscs of NumPy&lt;/a&gt; and &lt;a href="https://youtu.be/QUT1VHiLmmI"&gt;NumPy in One hour by freeCodeCamp&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Introduction to NumPy&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;NumPy is the fundamental package for scientific computing with python. It contains many other things:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;N-dimensional array object&lt;/li&gt;
&lt;li&gt;Broadcasting Functions&lt;/li&gt;
&lt;li&gt;Useful for linear algebra, Fourier transform and random number capabilities.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;em&gt;Broadcasting means when you do any operation on the array that operation will get implemented on each element of the array&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Difference between numpy array and list&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Data type in NumPy should be same whereas it can be different in lists&lt;/li&gt;
&lt;li&gt;Faster to read less bytes of memory&lt;/li&gt;
&lt;li&gt;Contiguous memory&lt;/li&gt;
&lt;li&gt;No type checking when iterating through objects.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;em&gt;List is built in int-type so it consists of four different things it consists Size, Refernce count, Object Type, Object Value.&lt;/em&gt;&lt;br&gt;
This makes NumPy faster than Lists.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Importing and Creating NumPy array&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--yEuft8dq--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/ee2n4k4ave6lbeahqqq2.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--yEuft8dq--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/ee2n4k4ave6lbeahqqq2.png" alt="Image description" width="332" height="690"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--2PFs4GT---/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/48g41le5kspgtw6c0apb.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--2PFs4GT---/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/48g41le5kspgtw6c0apb.png" alt="Image description" width="415" height="240"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Broadcasting and Creating matrix&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--rCvP-TeR--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/d1pbmng42klbpddnct76.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--rCvP-TeR--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/d1pbmng42klbpddnct76.png" alt="Image description" width="385" height="447"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Creating Random arrays&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--QSLeuSSs--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/j5hq17k45cuezejb6gfa.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--QSLeuSSs--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/j5hq17k45cuezejb6gfa.png" alt="Image description" width="558" height="722"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--tR9cgUek--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/ylt1ngptmlqhgw35zl2p.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--tR9cgUek--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/ylt1ngptmlqhgw35zl2p.png" alt="Image description" width="387" height="686"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Array Concatenation&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--O-AK6Wo1--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/gmzmvb2tqnaa5i32c0jd.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--O-AK6Wo1--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/gmzmvb2tqnaa5i32c0jd.png" alt="Image description" width="531" height="610"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--q-dpbqFa--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/5x4hoj2m45106w3c23rv.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--q-dpbqFa--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/5x4hoj2m45106w3c23rv.png" alt="Image description" width="540" height="172"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Accessing/ Changing specific elements, rows, columns&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--aIf_RTi6--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/kpi2un86s3behe8qhgpk.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--aIf_RTi6--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/kpi2un86s3behe8qhgpk.png" alt="Image description" width="602" height="647"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--VTMB6rm0--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/2osssg6olkpp9gbj736g.png" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--VTMB6rm0--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://dev-to-uploads.s3.amazonaws.com/uploads/articles/2osssg6olkpp9gbj736g.png" alt="Image description" width="495" height="472"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Conclusion
&lt;/h2&gt;

&lt;p&gt;During my recent learning experience, I acquired knowledge on statistics, array manipulation such as reorganizing and stacking, as well as loading data from files. Additionally, I attended a Women Coders Meetup where I gained valuable insights, expanded my knowledge, and networked with individuals who share my interests. As a result, I took a break from my studies to attend this event.&lt;br&gt;
I will continue listing my daily progress and try to remain consistent. Please do share your feedback on how I can my 100daysofcode challenge more productive. I'll see you tomorrow for my daily update.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Follow me on Twitter: @purvamasurkar12&lt;/li&gt;
&lt;li&gt;Follow me on LinkedIn: @purvamasurkar12&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>machinelearning</category>
      <category>100daysofcode</category>
      <category>womenintech</category>
      <category>codenewbie</category>
    </item>
    <item>
      <title>Day 2 of 100 Days of ML Code: Python Basics</title>
      <dc:creator>Purva Masurkar</dc:creator>
      <pubDate>Thu, 09 Mar 2023 09:03:05 +0000</pubDate>
      <link>https://forem.com/purvanova213/day-2-of-100-days-of-ml-code-python-basics-24o3</link>
      <guid>https://forem.com/purvanova213/day-2-of-100-days-of-ml-code-python-basics-24o3</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;"Python is an experiment in how much freedom programmers need. Too much freedom and nobody can read another's code; too little and expressiveness is endangered." - Guido van Rossum, creator of Python&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;On the second day of my #100daysofcode challenge, I have successfully completed the fundamental concepts of Python, such as conditional statements, loops, data structures, string manipulation, functions, modules, packages, and text file handling. I reviewed these concepts from &lt;a href="https://courses.analyticsvidhya.com/courses/Machine-Learning-Certification-Course-for-Beginners"&gt;ML course for beginners&lt;/a&gt;. I am now excited to explore the Python Libraries for Data science.&lt;/p&gt;

&lt;p&gt;I learned a lot of concepts today, but it was too many to list here. I will continue listing my daily progress starting from tomorrow. I'll see you tomorrow for my daily update.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Follow me on Twitter: &lt;a href="https://twitter.com/purvamasurkar12"&gt;@purvamasurkar12&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Follow me on LinkedIn: &lt;a href="https://www.linkedin.com/in/purva-masurkar-6585291bb/"&gt;@purvamasurkar12&lt;/a&gt;
&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>machinelearning</category>
      <category>100daysofcode</category>
      <category>womenintech</category>
      <category>codenewbie</category>
    </item>
    <item>
      <title>My 100 Days of Machine Learning: A Journey Towards Consistent Programming</title>
      <dc:creator>Purva Masurkar</dc:creator>
      <pubDate>Wed, 08 Mar 2023 04:24:12 +0000</pubDate>
      <link>https://forem.com/purvanova213/my-100-days-of-machine-learning-a-journey-towards-consistent-programming-1lk</link>
      <guid>https://forem.com/purvanova213/my-100-days-of-machine-learning-a-journey-towards-consistent-programming-1lk</guid>
      <description>&lt;p&gt;&lt;strong&gt;Introduction&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Hi everyone! I'm Purva Masurkar, currently a third-year student pursuing BE in Information Technology. Recently, I've made a decision to publicly commit to 100 days of learning Machine Learning and I'm excited to document my journey here on this blog.&lt;/p&gt;

&lt;p&gt;As a student, I've struggled to consistently build my programming skills, often finding myself starting from the basics repeatedly and eventually giving up mid-way. Through public commitment and sharing my progress, I hope to build the habit of coding daily and improve my skills in Python, Machine Learning, and Deep Learning.&lt;/p&gt;

&lt;p&gt;Additionally, this commitment to public sharing will allow me to improve my English through writing and interacting with like-minded individuals. By documenting my learning journey, I aim to not only track my progress but also convince others to embark on their own journey towards consistent programming and personal growth. &lt;/p&gt;

&lt;p&gt;&lt;strong&gt;## Day 1&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;As I kick off my 100 Days of Machine Learning journey, I decided to start off by taking things slow and steady. Day 1 involved getting acquainted with some basic terminologies of Machine Learning. I didn't want to dive too deep into the complexity right away, so I'm taking my time to build a strong foundation. After all, Rome wasn't built in a day, and neither will my ML expertise!&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Machine Learning&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;ML is a modern software development technique that enables computers to solve problems by using examples of real world data. It's a subset of Artificial Intelligence that includes three major categories:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Supervised Learning : In this every data has a corresponding label.&lt;/li&gt;
&lt;li&gt;Unsupervised Learning: There are no labels for the training data.&lt;/li&gt;
&lt;li&gt;Reinforcement learning: In this it tells us which action to take in a situation to maximize the reward.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Components of ML&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;ML model :It is an extremely generic program or block of code that can be modified to solve different but related problems.&lt;/li&gt;
&lt;li&gt;Model training algorithm: The current model iteration is analyzed to determine what changes can be made to get closer to the goal.&lt;/li&gt;
&lt;li&gt;Model Inference: Here the trained model is used to generate predictions.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Steps in ML process&lt;/strong&gt;&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Define the problem&lt;/li&gt;
&lt;li&gt;Build the Dataset&lt;/li&gt;
&lt;li&gt;Train the model&lt;/li&gt;
&lt;li&gt;Evaluate the model&lt;/li&gt;
&lt;li&gt;Use the model&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;We first collect the data and inspect it for outliers, missing values, data reformatting, and data visualisations. We also randomly split the data, which allows us to keep data hidden during training and use it to test the model before releasing it into production.&lt;/p&gt;

&lt;p&gt;The model training algorithm iteratively updates a model's parameters to minimize some loss function.&lt;/p&gt;

&lt;p&gt;The &lt;em&gt;loss function&lt;/em&gt; is a measure of how well the model is performing on a given task. It measures the difference between the predicted output and the true output for a given set of input data. &lt;br&gt;
Whereas &lt;em&gt;Model's parameter&lt;/em&gt; are setting or configuration that the training algorithm can update to change how model behaves.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Hyperparameters&lt;/em&gt; are setting that are not changes during training but can affect how quickly or how reliably the model trains such as number of clusters the model should identify.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Closing Remarks&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I understand that all of the above terms are basic, but I want to brush up on my previous knowledge for the first two days before diving deeper into it. Please leave comments on how I may better my learnings and blogs. I will take all of your suggestions into consideration. &lt;/p&gt;

</description>
      <category>machinelearning</category>
      <category>100daysofcode</category>
      <category>codenewbie</category>
      <category>womenintech</category>
    </item>
  </channel>
</rss>
