<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Carol Ajando</title>
    <description>The latest articles on Forem by Carol Ajando (@carol_ajando).</description>
    <link>https://forem.com/carol_ajando</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F1897517%2F87d9bbd3-46f0-439a-a620-042b79222b2a.png</url>
      <title>Forem: Carol Ajando</title>
      <link>https://forem.com/carol_ajando</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/carol_ajando"/>
    <language>en</language>
    <item>
      <title>Understanding your Data:The Essentials of Explanatory Data Analysis</title>
      <dc:creator>Carol Ajando</dc:creator>
      <pubDate>Wed, 21 Aug 2024 18:30:19 +0000</pubDate>
      <link>https://forem.com/carol_ajando/understanding-your-datathe-essentials-of-explanatory-data-analysis-39nm</link>
      <guid>https://forem.com/carol_ajando/understanding-your-datathe-essentials-of-explanatory-data-analysis-39nm</guid>
      <description>&lt;p&gt;Originally developed in the 1970s by John Turkey, Explanatory Data Analysis (EDA), continues to be widely used in data science till date.&lt;br&gt;
Explanatory Data Analysis (EDA), is used by data scientists to get a better look and understanding of data sets by utilizing data visualization methods.&lt;br&gt;
EDA allows data scientists to analyze, investigate and identify main characteristics of data sets.&lt;br&gt;
EDA is used to see what data can reveal beyond the formal modeling or hypothesis testing hence providing a better understanding of data set variables and the relationships between them.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;##  Why is EDA in data science important?&lt;/strong&gt;&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;The main purpose of EDA is to get a better understanding of data before making any assumptions.&lt;/li&gt;
&lt;li&gt;EDA facilitates the process of data cleaning where it makes it easy to identify outliers, duplicates and any errors within the data set.&lt;/li&gt;
&lt;li&gt;EDA enables data scientists to produce effective, error-free &amp;amp; Valid results that can be used for decision-making.&lt;/li&gt;
&lt;li&gt;EDA is used to identify patterns and features within the data set such as categorical variables, mean, mode, standard deviation and confidence intervals.&lt;/li&gt;
&lt;li&gt;After completion of EDA and insights drawn the features identified can be used for more sophisticated data analysis or modelling such as machine learning.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;strong&gt;## Tpes of EDA&lt;/strong&gt;&lt;br&gt;
EDA can be categorized into Univariate, Bivariate and multivariate.&lt;br&gt;
The categories can be identified further into graphical and non-graphical&lt;br&gt;
Non-graphical methods are used mostly for statistical deductions.&lt;br&gt;
Graphical methods are used to get a full picture of the data sets.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Univariate: This is the most simple form of data analysis which  is used where the data sets consists of one variable/column. This includes; boxplots, histogram.&lt;/li&gt;
&lt;li&gt;Bivariate: Is used to decribe data and find patterns that exist within data sets that contain two variables/columns. The most commonly used graphical presentation include; scatter plots, bar plots.&lt;/li&gt;
&lt;li&gt;Multivariate: Is used for EDA of data sets that contain multiple variables/columns. This included; scatter plots, heatmaps,Runchart, bubble charts.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Python and R programming languages are the mostly used tools that are used to perform EDA.&lt;br&gt;
Common libraries used in python for EDA: Numpy, Pandas, Matplotlib, seaborn, and Ploty.&lt;br&gt;
There is no single method in performing EDA, the decision varies depending on the data set that you have.&lt;/p&gt;

</description>
      <category>beginners</category>
      <category>career</category>
      <category>datascience</category>
      <category>devchallenge</category>
    </item>
    <item>
      <title>Expert advice on how to build a sucessful career in data science</title>
      <dc:creator>Carol Ajando</dc:creator>
      <pubDate>Wed, 07 Aug 2024 19:52:08 +0000</pubDate>
      <link>https://forem.com/carol_ajando/expert-advice-on-how-to-build-a-sucessful-career-in-data-science-3h2p</link>
      <guid>https://forem.com/carol_ajando/expert-advice-on-how-to-build-a-sucessful-career-in-data-science-3h2p</guid>
      <description>&lt;p&gt;In today's data-driven world,Data science has become one of the fastest rising and most sought-after field.Data science involves extraction,analysis and interpretation of data to develop valuable insight and inform strategic decisions.Diving into the data science space requires a lot of hard work and dedication.&lt;br&gt;
Here are some of the tips and pre-requisites you need to build a sucessful career in data science.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1.Education&lt;/strong&gt;&lt;br&gt;
Data science involves a lot of calculations therefore Math is key.&lt;br&gt;
A career in data science is built on a strong mathematical foundation that includes; Linear algebra, matrix theory,calculation,statistics and probability.&lt;br&gt;
While many data scientists are self-taught,strong IT background in fields such as Computer Science, Statistics or Engineering may be required but not necessary.&lt;br&gt;
Many people are making use of online resources which are mostly free such as courses offered on google, IBM,freecodecamp.org,W3 schools etc.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2.Skills&lt;/strong&gt;&lt;br&gt;
A combination of technical,analytical and soft skills are required in the field of data science.&lt;br&gt;
These skills include;&lt;br&gt;
&lt;strong&gt;1.Data visualization:&lt;/strong&gt; Transforming data and findings into understandable and visually appealing formats using tools like Tableau,PowerBi and libraries in python such as (Matplotlib &amp;amp;seaborn)&lt;br&gt;
&lt;strong&gt;2.Programming&lt;/strong&gt;:&lt;br&gt;
This is a non-negotiable skill in data science.One should be proficient in Python and R languages which are esswntial for data manupulation,statistical analysis and machine learning.&lt;br&gt;
&lt;strong&gt;3.Machine Learning and AI:&lt;/strong&gt;&lt;br&gt;
This includes understanding and implementing machine learning such as Scikit-learn,Tensor flow and keras.&lt;br&gt;
&lt;strong&gt;4.Data Wrangling:&lt;/strong&gt;&lt;br&gt;
This is the ability to handle missing values,outliers,merging datasets indo desired formats for analysis.&lt;br&gt;
Some of the soft skills include:Good communication,Business Acumen and  curiosity and open to learning.&lt;br&gt;
These are just but a few of the skills required in the field of data science.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3.Job Searching&lt;/strong&gt;&lt;br&gt;
This is the secind last steps in building a career in data science and probably one of the steps that would take a longer time depending on luck, network and many more.&lt;br&gt;
&lt;strong&gt;Networking&lt;/strong&gt; can significantly improve your job search and imorove your chances to land a job.This includes connecting with professionals and data scientists as yourself.Tools such as Linkedin have proven to be very sucessful as a networking tool to land jobs and connect.&lt;br&gt;
&lt;strong&gt;Customize CV and cover letter&lt;/strong&gt; according to each application and highlighting relevant skills while also using key-words.&lt;br&gt;
Lastly but most importantly &lt;strong&gt;create and document your projects.&lt;/strong&gt;This helps to highlight your skills to any potential emoloyer.Add a link or attach samples in any Job application that you do.&lt;/p&gt;

&lt;p&gt;The future is built on data and the demand for data scientists keeps growing each day.The field is ever-evolving so, keep learning to keep-up with emerging technologies to prevent being obsolete in this field.&lt;/p&gt;

&lt;p&gt;Data science is an exiting field ,brace yourself for a fullfilling Journey ahead!&lt;/p&gt;

</description>
      <category>beginners</category>
      <category>programming</category>
      <category>python</category>
      <category>career</category>
    </item>
  </channel>
</rss>
