<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: maggy njuguna</title>
    <description>The latest articles on Forem by maggy njuguna (@maggy_njuguna).</description>
    <link>https://forem.com/maggy_njuguna</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3708635%2Fc34b9345-9109-4ab8-ab52-d4ddef96bdc5.png</url>
      <title>Forem: maggy njuguna</title>
      <link>https://forem.com/maggy_njuguna</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/maggy_njuguna"/>
    <language>en</language>
    <item>
      <title>Introduction to python for data analytics.</title>
      <dc:creator>maggy njuguna</dc:creator>
      <pubDate>Sun, 15 Mar 2026 13:49:12 +0000</pubDate>
      <link>https://forem.com/maggy_njuguna/introduction-to-python-for-data-analytics-130h</link>
      <guid>https://forem.com/maggy_njuguna/introduction-to-python-for-data-analytics-130h</guid>
      <description>&lt;h1&gt;
  
  
  Introduction
&lt;/h1&gt;

&lt;p&gt;Python is a programming language created by Guido van Rossum and released in 1991.The most recent version of python is python 3 which is widely used.Python was invented to simplify programming by enabling writing of simple and clear code.&lt;br&gt;
Programming sounds complex but its not ,in simple terms it is the process of giving instructions to a computer on how to perform specif tasks.Python is one of the languages used to give directions to computers on how to perform tasks.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Uses of Python.&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Python can be used on a server to create web applications -this involves designing, creating and testing software programs that can be accessed over the internet using a browser.&lt;/li&gt;
&lt;li&gt;Data analysis -Python is used in data analysis to perform analysis on large data sets and create visualizations.&lt;/li&gt;
&lt;li&gt;Software development -Python is used to create programs that allow users to perform tasks on a mobile phone or a computer.It has been used to create tools that automate tasks example sending automated emails or analysis of large data sets.&lt;/li&gt;
&lt;li&gt;Automation -Ever gotten tired of doing the same task over and over again?, Python comes in handy and helps automate repetitive tasks example collecting customer information from a company's website. &lt;/li&gt;
&lt;li&gt;Artificial intelligence and machine learning.
Machine learning in simple words is teaching a computer how to perform tasks.Example teaching a computer to identify many balls and how it can be able to identify a blue ball from many balls.
Artificial intelligence aims at giving computers human intelligence that is enabling them to think and make decisions like human beings.Imagine that !&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Why Python is preferred over other programming languages.&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Simple beginner friendly syntax -syntax is simply the set of rules that enable a computer to perform tasks .Python syntax is simple like the English language which makes it easy to learn.&lt;/li&gt;
&lt;li&gt;Python can be used in different platforms -It runs across many operating systems without requiring major modifications.&lt;/li&gt;
&lt;li&gt;Flexibility- Python can be used in many fields including automation, data analysis ,web development and artificial intelligence.&lt;/li&gt;
&lt;li&gt;Global community of developers -Python has good community support which ensures availability of learning resources and quick responses to solving problems.&lt;/li&gt;
&lt;li&gt;Python allows for different programming styles.

&lt;ul&gt;
&lt;li&gt;Procedural -the program to be followed by a computer  is written step by step .&lt;/li&gt;
&lt;li&gt;Object-oriented -this programming style instructions to be followed by a computer around real life objects.&lt;/li&gt;
&lt;li&gt;Functional programming - it uses functions to perform calculations and analysis on data.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Python installation.&lt;/strong&gt;&lt;br&gt;
Many systems have python already installed however if your computer does not have python installed it can be downloaded from the official website. &lt;br&gt;
To check on your python version run this code in your command line(cmd)&lt;br&gt;
&lt;code&gt;c:/users/ --python version&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Python can run in various IDE(Integrated Development Environment) including; Sublime ,Visual studio code,PyCharm and jupyter notebook.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Python syntax&lt;/strong&gt;&lt;br&gt;
Python is executed using simple syntax.&lt;br&gt;
&lt;code&gt;print("Hello world")&lt;/code&gt;&lt;br&gt;
&lt;strong&gt;Python indentation&lt;/strong&gt;&lt;br&gt;
Indentation in python is used to indicate a block of code.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2jhgww63byoedr7poybh.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2jhgww63byoedr7poybh.png" alt=" " width="800" height="280"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Comments in python&lt;/strong&gt;&lt;br&gt;
Single line comments begin with the harsh symbol.&lt;br&gt;
&lt;code&gt;# perform analysis&lt;/code&gt;&lt;br&gt;
Using docstrings for multiple lines of comments .&lt;code&gt;""" or '''&lt;/code&gt;&lt;br&gt;
&lt;strong&gt;Identifiers in python&lt;/strong&gt;&lt;br&gt;
An identifier is a name used to identify a variable or function.&lt;br&gt;
&lt;code&gt;name = "Angel"&lt;/code&gt;&lt;br&gt;
&lt;code&gt;score = 90&lt;/code&gt;&lt;br&gt;
Name and score are identifiers used to name variables.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Variables in python&lt;/strong&gt;&lt;br&gt;
A variable in python is a name used to store a value in a program and will later be used in the program.&lt;br&gt;
&lt;code&gt;Name = "Maggy"&lt;/code&gt;&lt;br&gt;
&lt;code&gt;Town = "Nakuru"&lt;/code&gt;&lt;br&gt;
&lt;code&gt;Age = 15&lt;/code&gt;&lt;br&gt;
Name, age, and town are the variables for storing Maggy, town and age. &lt;br&gt;
 &lt;em&gt;You don't have to specify the data type in python unlike other programming languages.Python understands the data type based on the assigned value.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Differences between a variable and an identifier&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;An identifier is a name used to identify elements in a program while a variable is a name used to store a value inside a program.&lt;/li&gt;
&lt;li&gt;An identifier is used to label variables and functions while a variable is used to store values that can be used later in a program.&lt;/li&gt;
&lt;li&gt;An identifier does not hold data while a variable hold data.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Examples of identifiers &lt;code&gt;name,age,city,score,calculate_sum&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;Examples of variables &lt;code&gt;name = "Maggy"&lt;/code&gt; ,&lt;code&gt;city = "kisumu"&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Rules for naming variables.&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;A variable name should start with a letter or an undescore.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;code&gt;City = "Mombasa"&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;code&gt;-school = "Daystar"&lt;/code&gt;&lt;br&gt;
&lt;code&gt;_number = 23&lt;/code&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;A variable cannot start with a number.&lt;/li&gt;
&lt;li&gt;Variable names are case sensitive.&lt;/li&gt;
&lt;li&gt;You cannot name a variable using python keywords.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The = (assignment operator) is used to assign value in python.&lt;br&gt;
Python allows the use of multiple variables in a single line.&lt;/p&gt;

&lt;p&gt;&lt;code&gt;y, x, z = 2,3,5&lt;/code&gt;&lt;br&gt;
&lt;code&gt;print(y)&lt;/code&gt;&lt;br&gt;
&lt;code&gt;print(x)&lt;/code&gt;&lt;br&gt;
&lt;code&gt;print(z)&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;In python uppercase letters are used on a variable that should not change.&lt;/em&gt;&lt;br&gt;
&lt;code&gt;PI = 3.142&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;The type() function is used to check the type of your variable .&lt;br&gt;
&lt;code&gt;y = 10&lt;/code&gt;&lt;br&gt;
&lt;code&gt;print(type(x))&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Data types in python&lt;/strong&gt;&lt;br&gt;
Data type is the classification of data that enables a computer identify the data its working with and the operations that be performed on that data.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Numeric types&lt;/strong&gt;

&lt;ul&gt;
&lt;li&gt;int -these are whole numbers .&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fncpf67yf74ey9yjsnu5a.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fncpf67yf74ey9yjsnu5a.png" alt=" " width="509" height="222"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Floats- these are numbers with decimal points. &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fui4d69gtuk0yg1yp12ut.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fui4d69gtuk0yg1yp12ut.png" alt=" " width="653" height="216"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Complex -these are used in scientific computing. &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F4zjcjkprhjjhcjxld46a.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F4zjcjkprhjjhcjxld46a.png" alt=" " width="529" height="163"&gt;&lt;/a&gt;&lt;br&gt;
Changing data types.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fc6ox4br88am6a4xgb77e.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fc6ox4br88am6a4xgb77e.png" alt=" " width="371" height="255"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Text types&lt;/strong&gt;

&lt;ul&gt;
&lt;li&gt;str - string is a sequence of characters.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F4wzo2q1q9kn6f437gnqv.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F4wzo2q1q9kn6f437gnqv.png" alt=" " width="800" height="345"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Boolean type&lt;/strong&gt;
This data type has two possible values:True or false.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fcoymb89rc4uad8qyt9b0.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fcoymb89rc4uad8qyt9b0.png" alt=" " width="800" height="252"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Sequence types&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;List -it is ordered ,changable and allows duplicates.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxbyzoqfns2borp04ubvv.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxbyzoqfns2borp04ubvv.png" alt=" " width="800" height="185"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Tuple - it is ordered, unchangable and allows duplicates.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3hkhil1rucpbi20515ka.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3hkhil1rucpbi20515ka.png" alt=" " width="800" height="312"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Range - represents a sequence of numbers.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0bhclho971l8cv86w3rl.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0bhclho971l8cv86w3rl.png" alt=" " width="693" height="252"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Mapping types&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt; Dictionaries this is a collection of key value pairs and is used to quickly look up values and organize data.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fosm5x6l2sofs5a72ttm3.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fosm5x6l2sofs5a72ttm3.png" alt=" " width="800" height="243"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Set types&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;set- unordered and does not allow duplicates.&lt;/li&gt;
&lt;li&gt;Frozen -similar to sets but unchangable.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxdj2jprqqd1legjn4dto.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxdj2jprqqd1legjn4dto.png" alt=" " width="800" height="197"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Arithmetic operators&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F1r5xi4hwyin8moixom7w.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F1r5xi4hwyin8moixom7w.png" alt=" " width="720" height="885"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Comparison and logical operators in python&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;== Equals to.&lt;/li&gt;
&lt;li&gt;!= Not equal to .&lt;/li&gt;
&lt;li&gt;&amp;gt;  Greater than.&lt;/li&gt;
&lt;li&gt;&amp;lt;  Less than.&lt;/li&gt;
&lt;li&gt;&amp;gt;= Greater than or equal to.&lt;/li&gt;
&lt;li&gt;&amp;lt;=   Less than or equal to.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fb9alewthosavr791y8hq.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fb9alewthosavr791y8hq.png" alt=" " width="728" height="386"&gt;&lt;/a&gt;&lt;br&gt;
&lt;strong&gt;Logical operators&lt;/strong&gt;&lt;br&gt;
The common logical operators in python include:&lt;em&gt;and, or and not&lt;/em&gt;&lt;br&gt;
and their use case is shown below.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F8m91cm55vm9jl9mpy6si.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F8m91cm55vm9jl9mpy6si.png" alt=" " width="800" height="403"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Conclusion&lt;/strong&gt;&lt;br&gt;
It is important for every data analyst to understand python since it makes data cleaning, calculations and analysis effective.&lt;/p&gt;

</description>
      <category>python</category>
      <category>productivity</category>
      <category>learning</category>
    </item>
    <item>
      <title>Connecting power bi to SQL databases locally and through cloud service.</title>
      <dc:creator>maggy njuguna</dc:creator>
      <pubDate>Fri, 13 Mar 2026 10:38:47 +0000</pubDate>
      <link>https://forem.com/maggy_njuguna/connecting-power-bi-to-sql-databases-locally-and-through-cloud-service-4876</link>
      <guid>https://forem.com/maggy_njuguna/connecting-power-bi-to-sql-databases-locally-and-through-cloud-service-4876</guid>
      <description>&lt;p&gt;Power BI is a  powerful tool designed for cleaning data ,creating reports and dashboards.It is preferred over other tools due to its ability to handle large data sets,create real-time dashboards and its beginner friendly.Power BI desktop is a free version provided by Microsoft.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Main uses of Power BI&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Connecting data from different sources- Power BI has the ability to connect data from many sources including ;web,excel,SQL.&lt;/li&gt;
&lt;li&gt;Cleaning and transforming data-power bi is used to clean messy data making it ready to use for analysis.&lt;/li&gt;
&lt;li&gt;Data modelling-Through power bi it is possible to combine different tables with data for effective analysis.&lt;/li&gt;
&lt;li&gt;Data analysis using DAX(Data Analysis Expressions)-DAX is used to perform calculations in power bi.&lt;/li&gt;
&lt;li&gt;Data visualisation -Power BI converts data into charts,filters,slicers,graphs and reports.
Reports and dashboard creation.&lt;/li&gt;
&lt;li&gt;Data automation, sharing and collaboration using Power BI 
service.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Why companies connect Power BI to databases.&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Power bi's ability to handle large data sets -databases store large sets of data and power bi is very effective for handling large data.&lt;/li&gt;
&lt;li&gt;Real time data -using power bi service companies can their data processed and accessed easily.Any changes made to data reflect instantly.&lt;/li&gt;
&lt;li&gt;Data security -databases ensure data is protected and control its access.&lt;/li&gt;
&lt;li&gt;Data centralisation -companies store their data in central databases and by connecting the database to power bi enables the staff to work with similar data at a given time.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Importance of SQL databases for storing and managing data&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;SQL databases have structured data storage - Data is stored in tables which have rows and columns and this makes it easy to retrieve and use the data.&lt;/li&gt;
&lt;li&gt;Data retrieval efficiency -It is easy to retrieve specific data from SQL databases using queries.&lt;/li&gt;
&lt;li&gt;Data integrity -SQL databases have rules and constraints including primary and foreign keys which ensure data remains accurate and consistent.&lt;/li&gt;
&lt;li&gt;Large data sets- Most organisations deal with large volumes of data and SQL databases are effective for storage and management of this data.&lt;/li&gt;
&lt;li&gt;SQL supports data relationship -Through SQL databases it is possible to connect data from different sources.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Connecting power bi to a local postgresql&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Open the power bi desktop .&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fasrjgibb4129qtbbu7gw.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fasrjgibb4129qtbbu7gw.png" alt=" " width="800" height="329"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fq9cqbp0q6mta2pelwfrn.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fq9cqbp0q6mta2pelwfrn.png" alt=" " width="800" height="360"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt; Click on &lt;code&gt;Get data&lt;/code&gt;in the home ribbon ,this enables power bi to connect to many data sources.&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Many options are available including excel,csv,web services click on postgresql .&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fffnp97vkz1hy8j24p9um.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fffnp97vkz1hy8j24p9um.png" alt=" " width="800" height="550"&gt;&lt;/a&gt;&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Click connect-once you select postgresql click the button to connect.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Enter the server name &lt;code&gt;local host:5432&lt;/code&gt;-this means that Power bi will be hosted locally in your computer.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fesemibnlqesibhub7bse.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fesemibnlqesibhub7bse.png" alt=" " width="800" height="407"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Enter the database name &lt;code&gt;postgres&lt;/code&gt;-this is the database that contains the tables we will be using.&lt;/li&gt;
&lt;li&gt;Enter the authentication detail ;username and password.&lt;/li&gt;
&lt;li&gt;Click connect.&lt;/li&gt;
&lt;li&gt;After connection is complete a navigator window will open which shows what your database contains.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fglkpqx9bf22qbyl74nqd.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fglkpqx9bf22qbyl74nqd.png" alt=" " width="800" height="762"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;Select the tables  you want to load from your database (Customers_2,sales,inventory and products).&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;You can load your data directly into power bi.&lt;/li&gt;
&lt;li&gt;You can use power query to clean and transform your data before loading.
&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fp8oc1au3u5lumqobz7oc.png" alt=" " width="800" height="510"&gt;
&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;li&gt;&lt;p&gt;Once your tables are loaded they will appear on the data tab as shown below.&lt;br&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ft0a58copjmxcjc0xb51r.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ft0a58copjmxcjc0xb51r.png" alt=" " width="417" height="427"&gt;&lt;/a&gt;&lt;/p&gt;&lt;/li&gt;

&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Connecting Power bi to Aiven PostgreSQL&lt;/strong&gt;&lt;br&gt;
The process of connecting power bi to a cloud database involves the following steps.Each step is illustrated with an image for clear understanding .&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Log into your aiven account and open your Aiven PostgreSQL to get your connection details.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fhhv5ls888qo3vn1ijqh8.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fhhv5ls888qo3vn1ijqh8.png" alt=" " width="800" height="394"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;You will find the following details:Host ,Database name,Port,username and password.&lt;/li&gt;
&lt;li&gt;Go the connection information and find the SSL certificate section.
&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fubk9okj769ql7j4h52cg.png" alt=" " width="800" height="120"&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fs7loj4fwidx9hxu6zdtd.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fs7loj4fwidx9hxu6zdtd.png" alt=" " width="800" height="351"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Download the SSL certificate and save it in your computer.&lt;/li&gt;
&lt;li&gt;Open power bi desktop and once you are in the &lt;code&gt;home&lt;/code&gt; tab click &lt;code&gt;Get data&lt;/code&gt; then select &lt;code&gt;PostgreSQL database&lt;/code&gt;
&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F093wz8tcepdwyremxobq.png" alt=" " width="800" height="516"&gt;
&lt;/li&gt;
&lt;li&gt;Enter the connection details provided by Aiven and click OK.
&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqbjkqc5cq1edhmg93217.png" alt=" " width="800" height="386"&gt;
&lt;/li&gt;
&lt;li&gt;Configure the SSL details -enter the certificate downloaded in your computer.&lt;/li&gt;
&lt;li&gt;The navigator window appears and click load or transform data.
&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fugcookekdxkwv1f0wcv2.png" alt=" " width="800" height="673"&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ft8g84dozlm2txvqq257m.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ft8g84dozlm2txvqq257m.png" alt=" " width="387" height="471"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;The SSL licence provides secure connection between Power bi and the cloud service(Aiven).&lt;/em&gt;&lt;br&gt;
&lt;em&gt;It protects the data from any malicious acts.&lt;/em&gt;&lt;br&gt;
&lt;em&gt;It also ensures you are connected to the right server and your credentials are protected.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Loading data into power bi&lt;/strong&gt;&lt;br&gt;
 Open power bi desktop and once you are in the &lt;code&gt;home&lt;/code&gt; tab click &lt;code&gt;Get data&lt;/code&gt; then select &lt;code&gt;PostgreSQL database&lt;/code&gt;&lt;br&gt;
 Select the tables  you want to load from your database (Customers_2,sales,inventory and products).&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;You can load your data directly into power bi.&lt;/li&gt;
&lt;li&gt;You can use power query to clean and transform your data.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ffw1we7y0acd7g8yfvdk9.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ffw1we7y0acd7g8yfvdk9.png" alt=" " width="800" height="305"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Data Modelling and relationships&lt;/strong&gt;&lt;br&gt;
This process involves structuring,organizing and connecting data tables for effective performance.Designing a good data model is important since it makes reports accurate and easy to understand.To set up a data model in Power BI is done by opening the report view and go to Data Model.It involves bringing many data tables together and connecting the relationship between them.&lt;br&gt;
&lt;em&gt;A schema is a structure that shows how data is organized and related in a data model&lt;/em&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Star Schema
&lt;/h2&gt;

&lt;p&gt;This data model contains one central fact table and many dimension tables.The dimension tables are connected directly to the Facts table.&lt;br&gt;
The fact table is at the center surrounded by the dimension tables forming the star shape.&lt;/p&gt;

&lt;h2&gt;
  
  
  Relationships in Data models
&lt;/h2&gt;

&lt;p&gt;This explains how tables are connected in a data model.&lt;/p&gt;

&lt;h2&gt;
  
  
  Types of relationships
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;1 to many(1:*)-One record in a dimension data relates to many records in a fact table.&lt;/li&gt;
&lt;li&gt;Many to 1(*:1)-Same as 1 to many but viewed in the opposite direction.&lt;/li&gt;
&lt;li&gt;Many to many(:)-multiple records in one table match multiple records in another table.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Facts table
&lt;/h2&gt;

&lt;p&gt;This table stores the key values for your analysis.It stores measurable and quantitative values and the values are mostly numeric.&lt;br&gt;
&lt;em&gt;In our case ,the sales table is the fact table.&lt;/em&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Dimension tables.
&lt;/h2&gt;

&lt;p&gt;This tables act like a look up table for the values in the facts table.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;In our case the inventory,products and customers table are the dimensional tables.&lt;/em&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Why is good data modelling critical?
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Good data modelling ensures data accuracy by reducing data ambiguity and duplication.&lt;/li&gt;
&lt;li&gt;A good data model improves performance by reducing the number of joins and relationships.Like in the case of star schema which has one fact table and a few dimension tables.&lt;/li&gt;
&lt;li&gt;Simplified analysis -a well designed data model makes it easy to perform accurate DAX calculations.&lt;/li&gt;
&lt;li&gt;A good data model ensures data integrity by enhancing accurate storage of data hence maintaining its integrity over time.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;CONCLUSION.&lt;/strong&gt;&lt;br&gt;
It is important for power bi analysts to have SQL skills since they enable them to retrieve the stored data,interpret it and come up with reports that will impact decision making.Most organizations store their data in SQL databases therefore the power bi analysts have to retrieve the data from the databases for analysis.&lt;br&gt;
Using the &lt;code&gt;WHERE&lt;/code&gt; clause analysts are able to work on specific data.&lt;br&gt;
SQL performs aggregation using functions such as &lt;code&gt;COUNT&lt;/code&gt;,&lt;code&gt;SUM&lt;/code&gt;,&lt;code&gt;AVERAGE&lt;/code&gt; which enable analysts to summarise large sets of data and draw conclusions.&lt;br&gt;
Analysts clean and transform data using query before building visualisations to feature in the dashboard.&lt;br&gt;
They also use joins,subqueries and common table expressions to combine data from multiple tables for analysis.&lt;/p&gt;

</description>
      <category>sql</category>
      <category>database</category>
      <category>productivity</category>
      <category>beginners</category>
    </item>
    <item>
      <title>ADVANCED SQL FOR DATA ANALYTICS:ADVANCED TECHNIQUES EVERY DATA ANALYST SHOULD KNOW.</title>
      <dc:creator>maggy njuguna</dc:creator>
      <pubDate>Mon, 09 Mar 2026 16:24:13 +0000</pubDate>
      <link>https://forem.com/maggy_njuguna/advanced-sql-for-data-analyticsadvanced-techniques-every-data-analyst-should-know-431d</link>
      <guid>https://forem.com/maggy_njuguna/advanced-sql-for-data-analyticsadvanced-techniques-every-data-analyst-should-know-431d</guid>
      <description>&lt;p&gt;&lt;strong&gt;Introduction&lt;/strong&gt;&lt;br&gt;
SQL(Structured Query Language) is an important tool in data analysis and is used to retrieve data ,manipulate and analyse stored data.Basic SQL techniques are important in data analysis however advanced techniques enhance analysis of complex data sets.The advanced techniques in SQL include;&lt;code&gt;window functions,subqueries,ctes,stored procedures and indexes&lt;/code&gt;.&lt;/p&gt;

&lt;h2&gt;
  
  
  REAL WORLD PROBLEMS IN DATA
&lt;/h2&gt;

&lt;p&gt;These are the products encountered by data analysts while trying to retrieve, manage and make decisions using data.Tools like SQL come in handy to solve these problems. Main data problems include;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Data quality and cleaning -data analysis involves using large data sets which are often messy,inconsistent and missing values and require cleaning and standardisation.Poor quality data can give inconsistent and unreliable results.&lt;/li&gt;
&lt;li&gt;Data integration -Most data is stored in different tables and requires merging which if not done correctly may lead to inconsistencies.&lt;/li&gt;
&lt;li&gt;Large volumes of data -organizations deal with large data sets which sometimes slow down query execution.&lt;/li&gt;
&lt;li&gt;Data security - organizations deal with sensitive information which if not handled properly may lead to data violation and legal issues.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  WINDOW FUNCTIONS IN SQL
&lt;/h2&gt;

&lt;p&gt;Window functions perform calculations on related rows and keeps each row in the result unlike &lt;em&gt;GROUP BY&lt;/em&gt; which groups the rows together.&lt;br&gt;
A window function allows viewing of a group of rows.&lt;br&gt;
The &lt;code&gt;OVER()&lt;/code&gt; Clause is used in window functions.&lt;br&gt;
Examples of window functions &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;code&gt;ROW_NUMBER()&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;&lt;code&gt;RANK()&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;&lt;code&gt;DENSE_RANK&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;&lt;code&gt;LEAD AND LAG&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;&lt;code&gt;NTILE&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;code&gt;RUNNING TOTAL&lt;/code&gt;&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;ROW_NUMBER.&lt;/strong&gt;&lt;br&gt;
It assigns a unique number to each row based on a condition.&lt;br&gt;
In the case of 2 rows having similar values, this window function issues different numbers to the given rows.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fzpfd35x0o56mz6gqcref.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fzpfd35x0o56mz6gqcref.png" alt=" " width="800" height="343"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;RANK.&lt;/strong&gt;
This window function issues a ranking number to each row based on the given conditions.
If two rows share the same value, they are assigned with the same value and the next number is skipped.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3o5f3mkswfx6cmj3dj6j.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3o5f3mkswfx6cmj3dj6j.png" alt=" " width="800" height="421"&gt;&lt;/a&gt;&lt;br&gt;
 &lt;em&gt;From this query rows with ties are issued with the same numbers and the next value is skipped.&lt;/em&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;DENSE_RANK&lt;/strong&gt;
Dense_rank is almost similar to rank and is used to issue ranks to rows but in relation to ties dense-rank does not leave gaps.The next number is issued after assigning a number to ties.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Example&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6zlxu9fl2ses2hr1qoxv.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6zlxu9fl2ses2hr1qoxv.png" alt=" " width="800" height="471"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Dense_rank does not skip the next number after handling ties.&lt;/em&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;LEAD AND LAG&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;code&gt;LEAD&lt;/code&gt; -is used to compare the current rows with the next row.Like comparing the total quantity of products purchased by a customer with the customer's next order.If there is no next value it returns NULL.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Example&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;code&gt;select order_id,order_date ,lead(order_date)OVER(order by order_date )as next_date,&lt;br&gt;
order_date-lead(order_date)OVER(order by order_date )&lt;br&gt;
from orders;&lt;/code&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;LAG&lt;/strong&gt; - is used to compare the current rows with the previous rows.Can be used to compare a customer's current order with the previous order.If there is no previous order it returns NULL.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;code&gt;select title,price,lag(price)over(order by price asc)as previous_price,&lt;br&gt;
price - lag(price)over(order by price asc)as price_diff  &lt;br&gt;
from books;&lt;/code&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;NTILE&lt;/strong&gt;
It is used to divide the results set into a specified number of roughly equal parts.Example in dividing orders into 4 parts based on quantity.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Example&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fg0vwgztzns58r2m078ux.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fg0vwgztzns58r2m078ux.png" alt=" " width="800" height="328"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;RUNNING TOTAL.&lt;/strong&gt;
This is a cumulative sum that adds the current value to previous values based on a specified order.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fa7umtv4aqolr7xh9ydvz.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fa7umtv4aqolr7xh9ydvz.png" alt=" " width="800" height="378"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;POINT TO REMEMBER&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;PARTITION BY.&lt;/strong&gt;
It is a part of the window function that divides data into groups before calculations.
It divides the results in groups just like GROUP BY but does not remove rows.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;HOW WINDOW FUNCTIONS PLAY A ROLE IN REAL WORLD DATA PROBLEMS&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Improved query performance- data analysis involves using large data sets and window functions come in handy to ensure fast execution instead of using joins or subqueries.&lt;/li&gt;
&lt;li&gt;Preserve data integrity -Window functions allow data analysts to view individual results and aggregated results.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Real word scenarios on the use of window functions.&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Ranking products or customers&lt;/strong&gt;
When performing sales analysis for businesses a data analyst can use the windows function(&lt;code&gt;Rank,Dense_rank&lt;/code&gt;) to give ranks to products or customers based on their performance.This helps the business identify the performance of products and decide on which products or customers need more marketing.&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Calculating revenue growth&lt;/strong&gt;&lt;br&gt;
Using the running total a business is able to calculate cumulative revenue over time.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Identifying trends and performance.&lt;/strong&gt;&lt;br&gt;
Using the window functions businesses can compare the performance of their products, customers over time and guide them in making strategic decisions.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Compare current values with previous or next values.&lt;/strong&gt;&lt;br&gt;
Using &lt;code&gt;lead and lag&lt;/code&gt; a company can perform trends in sales by doing comparisons on either the previous or next sales.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Subqueries&lt;/strong&gt;&lt;br&gt;
A subquery is a query written inside another query (Nested).Subqueries can appearin the following instances;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;In the &lt;code&gt;select&lt;/code&gt; clause.&lt;/li&gt;
&lt;li&gt;In the &lt;code&gt;Where&lt;/code&gt; clause.&lt;/li&gt;
&lt;li&gt;In the &lt;code&gt;From&lt;/code&gt; clause.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;In the select clause&lt;/strong&gt;&lt;br&gt;
Subqueries in the select clause add an extra computed column to the results.&lt;br&gt;
&lt;strong&gt;Example&lt;/strong&gt;&lt;br&gt;
&lt;code&gt;select name,(select count(*) from projects p where p.employee_id = e.employee_id )as total_projects&lt;br&gt;
from employees_2 e;&lt;/code&gt;&lt;br&gt;
&lt;strong&gt;In the where clause&lt;/strong&gt;&lt;br&gt;
Subqueries in the where clause allow the main query to filter results based on another's query output.&lt;br&gt;
&lt;strong&gt;Example&lt;/strong&gt;&lt;br&gt;
&lt;code&gt;select d.department_name ,e.salary&lt;br&gt;
from employees_2 e &lt;br&gt;
join departments d on e.department_id = d.department_id &lt;br&gt;
where salary =(select max(salary)from employees_2 where e.department_id =d.department_id);&lt;/code&gt;&lt;br&gt;
&lt;strong&gt;In the From clause&lt;/strong&gt;&lt;br&gt;
A subquery in the from clause creates a temporary table from which data can be selected or filtered.&lt;br&gt;
&lt;strong&gt;Example&lt;/strong&gt;&lt;br&gt;
&lt;code&gt;select department_name,total_employees  &lt;br&gt;
from(select department_id ,count(*)as total_employees &lt;br&gt;
from employees_2 &lt;br&gt;
group by department_id) as department_summary &lt;br&gt;
join departments d &lt;br&gt;
on d.department_id = department_summary.department_id;&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Reasons for using subqueries&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;They allow filtering data in the main query based on results of an inner query.&lt;/li&gt;
&lt;li&gt;Can be used in from clause for aggregation before proceeding with further analysis.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;How subqueries solve real world data problems&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Aggregate data comparison&lt;/strong&gt;
Using subqueries a data analyst can calculate the average performance of products and customers while the main query can identify products that are below or above that average.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Filtering data from complex conditions&lt;/strong&gt;
Subqueries can be used to perform calculations that can be filtered in other SQL queries.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Common Table Expressions(CTES)&lt;/strong&gt;&lt;br&gt;
CTES involves creating a temporary results set that only exists during the execution of the query.&lt;br&gt;
CTES breakdown complex queries into simpler more manageable and readable parts.&lt;br&gt;
CTES act like a virtual table which only exists during the execution of the main query.&lt;br&gt;
&lt;strong&gt;Example&lt;/strong&gt;&lt;br&gt;
Listing  employees earning more than 45000 using a cte&lt;/p&gt;

&lt;p&gt;&lt;code&gt;with high_earners as (select name,salary&lt;/code&gt; &lt;br&gt;
&lt;code&gt;from employees_2&lt;/code&gt; &lt;br&gt;
&lt;code&gt;where salary&lt;/code&gt; &lt;br&gt;
&lt;code&gt;&amp;gt;45000)&lt;/code&gt;&lt;br&gt;
&lt;code&gt;select * from high_earners;&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;CTES can be used to simplify joins.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Example&lt;/strong&gt;&lt;br&gt;
&lt;code&gt;with total_orders   as (select customer_id,sum(quantity) as total_quantity from orders &lt;br&gt;
group by customer_id)&lt;br&gt;
select first_name,last_name ,t.total_quantity&lt;br&gt;
from total_orders t&lt;br&gt;
join clients c  on t.customer_id = c.customer_id;&lt;/code&gt;&lt;br&gt;
&lt;strong&gt;Multiple CTES&lt;/strong&gt; -More than one CTE can be used and they are separated by commas.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Example&lt;/strong&gt;&lt;br&gt;
&lt;code&gt;with department_max as (select department_id,max(salary)as max_salary&lt;br&gt;
from employees_2&lt;br&gt;
group by department_id),&lt;br&gt;
top_earners as (select name,employee_id,e.department_id,salary&lt;br&gt;
from employees_2 e &lt;br&gt;
join department_max  m on e.department_id =m.department_id &lt;br&gt;
and e.salary = m.max_salary)&lt;br&gt;
select * from top_earners;&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;How CTES solve real world data problems&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Temporary nature-CTES results are not stored permanently in database, they disappear after a query is executed hence they don't need maintenance.&lt;/li&gt;
&lt;li&gt;Improving readability -CTES allow breaking down of queries into simple named steps instead of using nested unreadable queries.&lt;/li&gt;
&lt;li&gt;Reusability -CTES can be referenced several times hence no need of writing the same subquery multiple times.&lt;/li&gt;
&lt;li&gt;Performance optimization -complex nested queries can be turned into simple steps using CTES hence fast execution.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Key differences between subqueries and CTES.&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;CTES come before the main query while subqueries are inside the main query.&lt;/li&gt;
&lt;li&gt;CTES are easy to read however subqueries can be long and messy.&lt;/li&gt;
&lt;li&gt;CTES can be referenced several times while subqueries run once and cannot be reused.&lt;/li&gt;
&lt;li&gt;CTES are used when the query is complex and repititive while subqueries are used when you need quick nested results.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Stored procedures&lt;/strong&gt;&lt;br&gt;
Stored procedure is a block of SQL logic stored inside the database.&lt;br&gt;
Actions that can be executed using CTES include&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Inserting data into tables.&lt;/li&gt;
&lt;li&gt;Updating data in tables.&lt;/li&gt;
&lt;li&gt;Delete data from tables.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Example&lt;/strong&gt;&lt;br&gt;
This is a procedure that inserts data into the clients table.&lt;/p&gt;

&lt;p&gt;&lt;code&gt;create or replace procedure add_clients(&lt;br&gt;
p_first_name varchar,&lt;br&gt;
p_last_name varchar,&lt;br&gt;
p_email varchar,&lt;br&gt;
p_contact varchar,&lt;br&gt;
p_city varchar )&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;code&gt;language plpgsql&lt;/code&gt; &lt;br&gt;
&lt;code&gt;as $$&lt;/code&gt; &lt;br&gt;
&lt;code&gt;begin&lt;/code&gt; &lt;br&gt;
    &lt;code&gt;insert into&lt;/code&gt;&lt;br&gt;
&lt;code&gt;clients(first_name,last_name,email,contact,city)&lt;/code&gt;&lt;br&gt;
    &lt;code&gt;values (p_first_name,&lt;/code&gt; &lt;code&gt;p_last_name,p_email,p_contact,p_city );&lt;/code&gt;&lt;br&gt;
&lt;code&gt;end;&lt;/code&gt;&lt;br&gt;
&lt;code&gt;$$;&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;How stored procedures solve real world data problems&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Performance optimization -Using a call statement is faster than executing many lines of code.Because stored procedures are precompiled the database skips the compling steps hence faster execution.&lt;/li&gt;
&lt;li&gt;Stored procedures enhance data security-Data base administrators can allow users to execute the stored procedures instead of allowing them to directly use the &lt;code&gt;SELECT&lt;/code&gt; or &lt;code&gt;UPDATE&lt;/code&gt; procedures hence protecting the data from any malicious acts.&lt;/li&gt;
&lt;li&gt;Code reusability-Complex data procedures are written once and can be used across multiple executions.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Real world use case scenario.&lt;/strong&gt;

&lt;ul&gt;
&lt;li&gt;Banking -Transactions such as updating account balances,checking loan limits .&lt;/li&gt;
&lt;li&gt;Auditing - using stored procedures it is possible to track  changes in your data.
-Human resource department can use stored procedures to update staff records.&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Indexes&lt;/strong&gt;&lt;br&gt;
Data analysts encounter complex and large data sets in their day to day jobs and querying this data sets can be slow and cumbersome. Indexes act as a road map for databases and allow the data base management system to quickly locate specific data rows without scanning the entire database .This ensures fast execution of queries.&lt;br&gt;
&lt;code&gt;create index idx_employees_2_name on employees_2(name);&lt;/code&gt;&lt;br&gt;
Partial  indexes indexes rows that meet a particular condition.&lt;br&gt;
Indexes can also be used on joins.&lt;br&gt;
create index idx_emp_dept on employees_2(department_id);&lt;br&gt;
&lt;code&gt;select e.name,d.department_name&lt;br&gt;
from employees_2 e  &lt;br&gt;
inner join departments d &lt;br&gt;
on e.department_id = d.department_id &lt;br&gt;
where d.department_name ='Engineering';&lt;/code&gt;&lt;br&gt;
&lt;strong&gt;How indexes solve data problems&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Increases the speed of query execution- A search taking minutes can be reduced to seconds with indexes since the database only scans the specified rows instead of scanning the entire database.&lt;/li&gt;
&lt;li&gt;Efficient sorting and grouping.
Indexes store data in an sorted order which makes the &lt;code&gt;ORDER BY&lt;/code&gt;,&lt;code&gt;GROUP BY&lt;/code&gt; execution faster.&lt;/li&gt;
&lt;li&gt;Speed up join operations -Indexes speed up the process of merging rows.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;CONCLUSION&lt;/strong&gt;&lt;br&gt;
Data analysts face many data problems that hinder they ability to effectively analyse data .Some common data problems include poor quality data,data security  and bias ,dealing with large data sets.Complex SQL techniques enable data analysts to overcome this data problems and clearly analyse data.Indexing increases query optimization and reduces query execution time.Advanced SQL techniques enable data analysts to clean and transform data for effective analysis.It is important for every data analyst to master the advanced SQL techniques.&lt;/p&gt;

</description>
      <category>sql</category>
      <category>learning</category>
      <category>datascience</category>
      <category>beginners</category>
    </item>
    <item>
      <title>JOINS AND WINDOW FUNCTIONS IN SQL.</title>
      <dc:creator>maggy njuguna</dc:creator>
      <pubDate>Fri, 27 Feb 2026 19:21:32 +0000</pubDate>
      <link>https://forem.com/maggy_njuguna/joins-and-window-functions-in-sql-1pi</link>
      <guid>https://forem.com/maggy_njuguna/joins-and-window-functions-in-sql-1pi</guid>
      <description>&lt;p&gt;&lt;strong&gt;SQL&lt;/strong&gt; stands for structured Query Language and is the language used to communicate with a database.&lt;br&gt;
&lt;strong&gt;What is a database ?&lt;/strong&gt;&lt;br&gt;
An organized collection of data stored electronically.&lt;br&gt;
Data is stored in tables inside databases.&lt;br&gt;
SQL is used to retrieve data,update and analyse data.&lt;/p&gt;

&lt;h2&gt;
  
  
  JOINS IN SQL
&lt;/h2&gt;

&lt;p&gt;Joins are used in SQL to combine rows from two or more tables based on a common relationship(primary key)&lt;br&gt;
&lt;strong&gt;TYPES OF JOINS&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;INNER JOIN&lt;/strong&gt;
It returns rows that are same from both tables.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;EXAMPLE&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fzu2995dzz4674mwpdz7a.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fzu2995dzz4674mwpdz7a.png" alt=" " width="800" height="110"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F9ylgouj8kt4qln45x6f4.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F9ylgouj8kt4qln45x6f4.png" alt=" " width="800" height="311"&gt;&lt;/a&gt;&lt;br&gt;
From the records this are the matching rows from both tables.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;LEFT JOIN&lt;/strong&gt;
A Left join returns all rows from the left table and the matching rows from the right(second) table.
If there are no matching values,NULL values are returned from the right table.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;EXAMPLE&lt;/strong&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F8lv6g37or5pyxxvwpykj.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F8lv6g37or5pyxxvwpykj.png" alt=" " width="800" height="418"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;RIGHT JOIN&lt;/strong&gt;
A right join returns all rows from the right table and the matching rows from the left table.
If there is no match ,NULL values are returned for the columns from first(left) table.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;EXAMPLE&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqw6uqw5jm48yx7hunky1.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqw6uqw5jm48yx7hunky1.png" alt=" " width="800" height="345"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;From the records only four rows in the name column are returned, some departments do not have a match hence the NULL values are returned after performing the query.&lt;/em&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Full join&lt;/strong&gt;
It combines the results of a left and right join.
A full join returns all rows from both tables, matching rows that meet conditions and includes unmatched rows too.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;EXAMPLE&lt;/strong&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxqxs8375sgal2hhy5a4l.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxqxs8375sgal2hhy5a4l.png" alt=" " width="800" height="445"&gt;&lt;/a&gt;&lt;br&gt;
&lt;strong&gt;NB&lt;/strong&gt; Some databases like MySQL,MariaDB and SQLite do not support full join and instead a UNION operator is used to join a left and right join.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;SELF JOIN&lt;/strong&gt;
In this join operation a table is joined with itself.
It is used when you want to combine rows from the same table based on a related column.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fzwir4o6gqwm9f7sqfm5q.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fzwir4o6gqwm9f7sqfm5q.png" alt=" " width="800" height="344"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Joins are very important in SQL since they allow combining of data from different tables,they also reduce duplication by allowing data to be stored in separate tables and they also improve data flexibility.&lt;/em&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  WINDOW FUNCTIONS IN SQL
&lt;/h2&gt;

&lt;p&gt;Window functions perform calculations on related rows and keeps each row in the result unlike &lt;em&gt;GROUP BY&lt;/em&gt; which groups the rows together.&lt;br&gt;
A window function allows viewing of a group of rows.&lt;br&gt;
The &lt;code&gt;OVER()&lt;/code&gt; Clause is used in window functions.&lt;br&gt;
Examples of window functions &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;code&gt;ROW_NUMBER()&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;&lt;code&gt;RANK()&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;&lt;code&gt;RUNNING TOTAL&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;code&gt;AVG() OVER()&lt;/code&gt;&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;ROW_NUMBER.&lt;/strong&gt;&lt;br&gt;
It assigns a unique number to each row based on a condition.&lt;br&gt;
In the case of 2 rows having similar values, this window function issues different numbers to the given rows.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fzpfd35x0o56mz6gqcref.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fzpfd35x0o56mz6gqcref.png" alt=" " width="800" height="343"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;RANK.&lt;/strong&gt;
This window function issues a ranking number to each row based on the given conditions.
If two rows share the same value, they are assigned with the same value and the next number is skipped.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F4g6l6wvd4vb5m7lxraf9.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F4g6l6wvd4vb5m7lxraf9.png" alt=" " width="800" height="423"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;RUNNING TOTAL.&lt;/strong&gt;
This is a cumulative sum that adds the current value to previous values based on a specified order.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fa7umtv4aqolr7xh9ydvz.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fa7umtv4aqolr7xh9ydvz.png" alt=" " width="800" height="378"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;AVG() OVER()&lt;/strong&gt;
It calculates average without removing rows.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fsx04ro57uhzc1ceix394.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fsx04ro57uhzc1ceix394.png" alt=" " width="800" height="383"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;POINT TO REMEMBER&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;PARTITION BY.&lt;/strong&gt;
It is a part of the window function that divides data into groups before calculations.
It divides the results in groups just like GROUP BY but does not remove rows.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;CONCLUSION&lt;/strong&gt;&lt;br&gt;
Window function and joins are very important in data analysis ,without which data analysis would be cumbersome.Joins allow you to combine data from different tables for effective analysis and window functions enable you to keep every detail of your data.&lt;/p&gt;

</description>
      <category>beginners</category>
      <category>database</category>
      <category>sql</category>
      <category>tutorial</category>
    </item>
    <item>
      <title>How Analysts Translate Messy Data, DAX, and Dashboards into Action Using Power BI.</title>
      <dc:creator>maggy njuguna</dc:creator>
      <pubDate>Sun, 08 Feb 2026 18:03:28 +0000</pubDate>
      <link>https://forem.com/maggy_njuguna/power-bis-superpower-in-cleaning-datadax-and-creating-real-time-dashboards-282e</link>
      <guid>https://forem.com/maggy_njuguna/power-bis-superpower-in-cleaning-datadax-and-creating-real-time-dashboards-282e</guid>
      <description>&lt;p&gt;&lt;strong&gt;Introduction&lt;/strong&gt;&lt;br&gt;
Power BI is one of the most powerful tools designed for cleaning data ,creating reports and dashboards.It is preferred over other tools due to its ability to handle large data sets,create real-time dashboards and its beginner friendly.Power BI desktop is a free version provided by Microsoft.&lt;br&gt;
&lt;strong&gt;Main uses of Power BI&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Connecting data from different sources:web,excel,SQL.&lt;/li&gt;
&lt;li&gt;Cleaning and transforming data.&lt;/li&gt;
&lt;li&gt;Data modelling.&lt;/li&gt;
&lt;li&gt;Data analysis using DAX.&lt;/li&gt;
&lt;li&gt;Data visualisation:charts,filters,slicers.&lt;/li&gt;
&lt;li&gt;Reports and dashboard creation.&lt;/li&gt;
&lt;li&gt;Data automation, sharing and collaboration using Power BI service.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Data cleaning in Power BI&lt;/strong&gt;&lt;br&gt;
Data analysts work with complex data sets and cleaning data goes beyond deleting rows and renaming column.Power Bi provides a powerful ETL(Extract, Transform, Load) engine that plays a key role in cleaning messy data to ensure accuracy,consistency and effective analysis.Power BI shows column Quality and distribution which helps analysts identify inconsistencies in their data.Power query is used to clean data in Power BI.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fvv7bc1annzby288u7pdm.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fvv7bc1annzby288u7pdm.png" alt=" " width="800" height="312"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Messy Data&lt;/strong&gt;&lt;br&gt;
Just like the name suggests messy data has this qualities&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Blank rows and missing values.&lt;/li&gt;
&lt;li&gt;Duplicate records.&lt;/li&gt;
&lt;li&gt;Wrong data types.&lt;/li&gt;
&lt;li&gt;Inconsistent text formats("kiambu","MOMBASA")&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;em&gt;Removing duplicates&lt;/em&gt;&lt;br&gt;
If your data must have unique values select the columns and remove duplicates.&lt;br&gt;
&lt;em&gt;Correct formatting&lt;/em&gt;&lt;br&gt;
Your data should be formatted correctly.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Text: Employee Name, Country, Product Type.
&lt;/li&gt;
&lt;li&gt;Decimal Number: Market Price, Revenue, Gross profit.
&lt;/li&gt;
&lt;li&gt;Whole Number or Decimal: Farmer Code, depending on use.
&lt;/li&gt;
&lt;li&gt;Date: Planting Date, Harvest Date.
Use Transform &amp;gt; Detect Data Type to auto-detect, then adjust manually if needed. &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;em&gt;Removing errors and blank values&lt;/em&gt;&lt;br&gt;
For numeric columns remove the errors but do not delete the blanks.&lt;br&gt;
For text columns consolidate the null,errors to "Not provided",use consistent labels for the entire data.&lt;br&gt;
Blank values can be replaced with 0 in numeric columns.&lt;br&gt;
Do not delete rows unless it is necessary.&lt;br&gt;&lt;br&gt;
use Replace Values to change "Error" → "Not provided".&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Standardising text columns&lt;/em&gt; &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Trim to remove extra spaces.&lt;/li&gt;
&lt;li&gt;Uppercase to capitalise each word.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Impacts of a well cleaned data&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Accurate analysis.
-Enhanced Power BI performance.&lt;/li&gt;
&lt;li&gt;Proper data modelling. 
After cleaning your data, Use Transform &amp;gt; Detect Data Type again.
Click Close &amp;amp; Apply to load your cleaned data into Power BI.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  DAX in Power BI
&lt;/h2&gt;

&lt;p&gt;Sounds like programming,right?&lt;br&gt;
DAX stands for Data Analysis Expressions.&lt;br&gt;
DAX is a formula language used in Power BI to build measures,create calculated columns and calculated tables to transform raw data into meaningful insights.&lt;br&gt;
DAX mainly performs aggregation and iterator functions.&lt;br&gt;
&lt;strong&gt;Aggregation Functions&lt;/strong&gt;&lt;br&gt;
Aggregation is the process of combining rows of data into a single value.&lt;br&gt;
Common Aggregation functions.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;SUM&lt;/strong&gt; -This adds up all values in a numeric column.
Example: Total Revenue 
Total Revenue = SUM('Kenya Crops'[Revenue (KES)]) &lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;AVERAGE&lt;/strong&gt;-This adds up the mean of a column.
Example: Average Market Price 
Average Market Price = AVERAGE('Kenya Crops'[Market Price (KES/Kg)]) &lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;MIN/MAX&lt;/strong&gt;- Calculates the largest and smallest values.
Example: Minimum Yield 
Minimum Yield = MIN('Kenya Crops'[Yield (Kg)]) 
Example: Highest Profit 
Maximum Profit = MAX('Kenya Crops'[Profit (KES)])&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;MEDIAN&lt;/strong&gt;
Returns the middle value of a column.
Example: Median Profit 
Median Profit = MEDIAN('Kenya Crops'[Profit (KES)]) &lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;COUNT&lt;/strong&gt; 
Counts numeric values in a single column. 
Example: Count Yield values 
Yield Count = COUNT('Kenya Crops'[Yield (Kg)]) &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Iterator functions&lt;/strong&gt;&lt;br&gt;
Performs row by row evaluation and gives a final result.&lt;br&gt;
&lt;strong&gt;Examples&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;SUMX&lt;/strong&gt;
Evaluates an expression for each row in a table and then sums those results.
Total Profit (SUMX) = 
SUMX( 
'Kenya Crops', 
'Kenya Crops'[Revenue (KES)] - 'Kenya Crops'[Cost of Production (KES)])&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;AVERAGEX&lt;/strong&gt;
Evaluates an expression for each row and then returns the average of those results. 
Example 2: Average Profit per Acre 
Average Profit per Acre = 
AVERAGEX( 
'Kenya Crops', 
DIVIDE( 
'Kenya Crops'[Profit (KES)], 
'Kenya Crops'[Planted Area (Acres)])&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;COUNTX&lt;/strong&gt; 
COUNTX counts rows where an expression returns a non-blank value. 
Example 1: Count profitable farms 
Profitable Farms Count = 
COUNTX( 
'Kenya Crops', 
IF('Kenya Crops'[Profit (KES)] &amp;gt; 0, 1))&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Logical Functions in Power BI&lt;/strong&gt;&lt;br&gt;
They are used to make decisions based on conditions.&lt;br&gt;
Examples IF,Nested IF,OR function, AND function,IF WITH&amp;amp;&amp;amp;.&lt;/p&gt;

&lt;p&gt;Example: Profit or Loss Classification (Measure) &lt;br&gt;
Profit Status =IF( SUM('Kenya Crops'[Profit (KES)]) &amp;gt; 0, &lt;br&gt;
"Profitable", "Loss"))&lt;/p&gt;

&lt;h2&gt;
  
  
  Dashboards in Power BI.
&lt;/h2&gt;

&lt;p&gt;A dashboard is a carefully designed collection of charts that allows businesses to answer specific questions easily and quickly.A dashboard contains a one page high level view of key factors that tell a story at a single glance.Main elements of a dashboard include&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;KPI Cards (Revenue, Visits, Profit)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Trend lines (Sales over years)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Top / Bottom performers&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Alerts (threshold-based)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Simple filters  and slicers(date, region)&lt;br&gt;
&lt;em&gt;To create a good dashboard avoid too many visuals ,complex calculations and very large tables.&lt;/em&gt;&lt;br&gt;
Users read dashboards from top to bottom and from left to right. The most important information should be placed at the top and towards the left. &lt;br&gt;
The top section of a dashboard should contain KPIs. These should be displayed using cards. &lt;br&gt;
visuals and should be immediately visible. &lt;br&gt;
The middle section should contain trend and performance visuals. &lt;br&gt;
The bottom section should contain supporting breakdowns and detailed comparisons. &lt;br&gt;
Filters should be placed where they are easy to find, at the top or along one side of the dashboard.&lt;br&gt;
&lt;strong&gt;Main Visuals used in a dashboard&lt;/strong&gt;&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Charts&lt;/strong&gt;&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Column chart&lt;/strong&gt;,A column chart displays data using vertical bars to compare values across categories.It is used to compare values across categories.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fd5yzwedcp8uayzn94h5v.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fd5yzwedcp8uayzn94h5v.png" alt=" " width="800" height="451"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Bar chart&lt;/strong&gt; ,it is similar to a column chart but uses horizontal bars. &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0lk4o5nz80utw6r4t79w.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0lk4o5nz80utw6r4t79w.png" alt=" " width="800" height="447"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Line chart&lt;/strong&gt; , line chart shows trends over time using connected data points.Example Gross profit over years.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fpv08kdaezxablbqd1rru.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fpv08kdaezxablbqd1rru.png" alt=" " width="800" height="456"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Pie charts and Donut chart&lt;/strong&gt;, these are best for showing percentage contribution of categories to a whole. 
Examples Revenue share by crop type.
Best used when there are few categories (2–5).&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6ers9l4h90m9lo56xepn.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6ers9l4h90m9lo56xepn.png" alt=" " width="800" height="453"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Card visual&lt;/strong&gt;, it displays a single aggregated value. 
Use cases: Total revenue  and Total profit.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3fm90cxfyckdni5k2xdy.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3fm90cxfyckdni5k2xdy.png" alt=" " width="781" height="387"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Matrix visual&lt;/strong&gt;, it is similar to a pivot table with rows and columns. 
Examples:  Revenue by county and seasons.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkfjo87faymuu9woc2xkz.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkfjo87faymuu9woc2xkz.png" alt=" " width="800" height="369"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Scatter chart&lt;/strong&gt; ,Shows relationship between two numeric variables. 
Examples:  Cost vs revenue, Yield vs profit.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqdi2v2y5jy4z5hfj12po.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqdi2v2y5jy4z5hfj12po.png" alt=" " width="800" height="475"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Map (Bubble map)&lt;/strong&gt;, it displays data using geographic locations with bubbles.Example Gross profit by Country.
&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fcmgc5u5mnlmdtrr3jqrp.png" alt=" " width="800" height="471"&gt;
&lt;strong&gt;Interactive dashboard&lt;/strong&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fhurh4rdiu7mbu16fxviu.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fhurh4rdiu7mbu16fxviu.png" alt=" " width="800" height="436"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Conclusion.
&lt;/h2&gt;

&lt;p&gt;Effective analysis,accurate and consistent data is a reflection of a well cleaned data.DAX allows data analysts to transform cleaned data and give meaningful insights.A well designed dashboard enables stakeholders to make effective decisions.&lt;/p&gt;

</description>
      <category>analytics</category>
      <category>beginners</category>
      <category>productivity</category>
    </item>
    <item>
      <title>SCHEMAS AND MODELLING IN POWER BI.</title>
      <dc:creator>maggy njuguna</dc:creator>
      <pubDate>Sun, 01 Feb 2026 15:00:00 +0000</pubDate>
      <link>https://forem.com/maggy_njuguna/schemas-and-modelling-in-power-bi-5c3</link>
      <guid>https://forem.com/maggy_njuguna/schemas-and-modelling-in-power-bi-5c3</guid>
      <description>&lt;h2&gt;
  
  
  Data Modelling .
&lt;/h2&gt;

&lt;p&gt;This process involves structuring,organizing and connecting data tables for effective performance.Designing a good data model is important since it makes reports accurate and easy to understand.To set up a data model in Power BI is done by opening the report view and go to Data Model.It involves bringing many data tables together and connecting the relationship between them.&lt;br&gt;
&lt;em&gt;A schema is a structure that shows how data is organized and related in a data model&lt;/em&gt;&lt;br&gt;
The two common data models are &lt;em&gt;Star Schema&lt;/em&gt; and &lt;em&gt;Snowflake&lt;/em&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Star Schema
&lt;/h2&gt;

&lt;p&gt;This data model contains one central fact table and many dimension tables.The dimension tables are connected directly to the Facts table.&lt;br&gt;
The fact table is at the center sorrounded by the dimension tables forming the star shape.&lt;br&gt;
&lt;em&gt;Star schema is the data model mostly used in Power BI because it is simple to understand,executes DAX Functions effectively and its great for queries performance.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwohg1hvu6sbl643jqdzs.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwohg1hvu6sbl643jqdzs.png" alt=" " width="800" height="505"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Snow flake schema
&lt;/h2&gt;

&lt;p&gt;In this data model the Fact table is connected to dimension tables and the dimension tables can have a few dimension tables connected to them. A dimension table is connected indirectly to the fact table through another dimension table.&lt;br&gt;
&lt;em&gt;It is less preferred because of its complex nature, slow performance and its low speed in performing DAX functions.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fteza56b1y61p4ick4tow.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fteza56b1y61p4ick4tow.png" alt=" " width="800" height="515"&gt;&lt;/a&gt;&lt;br&gt;
&lt;strong&gt;Key difference between Star schema and Snowflake Data models&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Star schema contains a fact table sorrounded by dimension tables while snowflake contains a fact table connected to dimension table which are in turn connected to other dimension tables.&lt;/li&gt;
&lt;li&gt;In star schema a single join is used to connect the fact table to dimension table while in snowflake many joins are required.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Relationships in Data models
&lt;/h2&gt;

&lt;p&gt;This explains how tables are connected in a data model.&lt;/p&gt;

&lt;h2&gt;
  
  
  Types of relationships
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;1 to many(1:*)-One record in a dimension data relates to many records in a fact table.&lt;/li&gt;
&lt;li&gt;Many to 1(*:1)-Same as 1 to many but viewed in the oppositedirection.&lt;/li&gt;
&lt;li&gt;Many to many(:)-multiple records in one table match multiple records in another table.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Facts table
&lt;/h2&gt;

&lt;p&gt;This table stores the key values for your analysis.It stores measurable and quantitative values and the values are mostly numeric.&lt;br&gt;
&lt;em&gt;Example :Revenue,Gross profit,total sales,Quantity sold&lt;/em&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;A fact table contains many records,used for calculations(SUM,AVERAGE,MIN,MAX)&lt;/li&gt;
&lt;li&gt;A fact table has many rows and few columns.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Dimension tables.
&lt;/h2&gt;

&lt;p&gt;This tables act like a look up table for the values in the facts table.&lt;br&gt;
&lt;em&gt;Example:Employee name,Month,Country ,Region,Product category&lt;/em&gt;&lt;br&gt;
This tables contain textual data and are used for grouping and filtering data.&lt;br&gt;
They have many columns and fewer rows.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why is good data modelling critical?
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Good data modelling ensures data accuracy by reducing data ambiguity and duplication.&lt;/li&gt;
&lt;li&gt;A good data model improves performance by reducing the number of joins and relationships.Like in the case of star schema which has one fact table and a few dimension tables.&lt;/li&gt;
&lt;li&gt;Simplified analysis -a well designed data model makes it easy to perform accurate DAX calculations.&lt;/li&gt;
&lt;li&gt;A good data model ensures data integrity by enhancing accurate storage of data hence maintaining its integrity over time.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Conclusion.
&lt;/h2&gt;

&lt;p&gt;It is important for data analytics to understand data modelling since it plays a critical role in data analysis, reporting and decision making.A good data model improves performance and provides accurate data insights.&lt;/p&gt;

</description>
      <category>datascience</category>
      <category>beginners</category>
      <category>productivity</category>
    </item>
    <item>
      <title>Introduction to MS Excel for data analytics</title>
      <dc:creator>maggy njuguna</dc:creator>
      <pubDate>Sun, 25 Jan 2026 13:46:16 +0000</pubDate>
      <link>https://forem.com/maggy_njuguna/introduction-to-ms-excel-for-data-analytics-3569</link>
      <guid>https://forem.com/maggy_njuguna/introduction-to-ms-excel-for-data-analytics-3569</guid>
      <description>&lt;p&gt;Data Analysis is the process of analysing raw data to uncover hidden patterns, trends and actionable insights.Common tools used in data analysis include Excel,Power BI, SQL and Python.&lt;/p&gt;

&lt;h2&gt;
  
  
  Core components of Data Analysis.
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;1. Data Collection&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This   process involves  gathering data from different sources. Data can come from surveys, databases, spreadsheets, websites, sensors, or business systems. &lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. Data Cleaning and Preparation&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Raw data is often incomplete, duplicated, or inconsistent. Data cleaning involves removing errors, handling missing values, correcting formats, and ensuring consistency. &lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Data Storage and Management&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Data must be stored in a structured and secure way so its retrieval can be easy. This may involve spreadsheets, databases, or data warehouses. This process ensures data is reliable and easy to retrieve.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;4. Data Analysis&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This is the stage where data is examined to find patterns, trends, and relationships. It involves using statistical methods, formulas, and analytical tools.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;5. Data Visualization&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Data visualization uses charts, graphs, dashboards, and reports to present results clearly. Visuals make complex data easier to understand and help communicate insights to both technical and non-technical audiences.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;6. Interpretation and Insights&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;At this stage, the analyst explains what the results mean. Interpretation connects the analysis to real-world situations and business goals, turning numbers into meaningful insights.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;7. Decision Making&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The final component involves using insights to support decisions. Data-driven decisions help organizations improve performance, reduce risks, and identify new opportunities.&lt;/p&gt;

&lt;h2&gt;
  
  
  How Microsoft Excel Can Be Used for Basic Data Analysis
&lt;/h2&gt;

&lt;p&gt;Microsoft Excel is a popular tool used for basic data analysis. It is easy to learn, widely available, and very powerful for organizing and understanding data.It provides simple features that help turn raw data into useful information without needing advanced skills.&lt;/p&gt;

&lt;h2&gt;
  
  
  Basic Data Cleaning in Excel.
&lt;/h2&gt;

&lt;p&gt;Data cleaning in Excel involves preparing raw data so it is accurate, consistent, and ready for analysis. Excel provides easy tools and functions that help beginners clean data efficiently.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Removing Duplicates&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Excel allows users to remove repeated records using the Remove Duplicates feature. This ensures that each record appears only once, improving data accuracy.&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fgp4yably0bb99401gw6t.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fgp4yably0bb99401gw6t.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fr4yjk0r4b31c8tak7faq.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fr4yjk0r4b31c8tak7faq.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;br&gt;
&lt;strong&gt;Correcting Data Formats&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Excel helps standardize formats such as dates, numbers, and text. For example, dates can be converted from text format to date format, and numbers can be corrected if they are stored as text.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fp6mff9jzbwjnr9enymmz.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fp6mff9jzbwjnr9enymmz.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Frfmieizc2mqet0idnxu7.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Frfmieizc2mqet0idnxu7.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2tt6pijl4elt2dcts6qp.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2tt6pijl4elt2dcts6qp.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkn4hujntp0b5stuy6rq4.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkn4hujntp0b5stuy6rq4.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Trimming Extra Spaces&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Extra spaces can cause errors in analysis. The TRIM function is used to remove unwanted spaces before and after text.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F7dp4ybd2z51aatmrw5wc.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F7dp4ybd2z51aatmrw5wc.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fzf8xjvh04kk9x9fy5lsl.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fzf8xjvh04kk9x9fy5lsl.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;br&gt;
&lt;strong&gt;Standardizing Text&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Excel helps make text consistent using functions such as:&lt;/p&gt;

&lt;p&gt;UPPER – converts text to uppercase&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6fczfr0iqm32wp9ikpqm.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6fczfr0iqm32wp9ikpqm.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fez38bexzqrg6b4mzaehk.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fez38bexzqrg6b4mzaehk.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;LOWER – converts text to lowercase&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Feap6w0ny92gl2acp1z18.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Feap6w0ny92gl2acp1z18.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fw2iz6ja5bid8da7x2kla.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fw2iz6ja5bid8da7x2kla.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;PROPER – capitalizes the first letter of each word&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F89tpimyxlv7svush7xwd.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F89tpimyxlv7svush7xwd.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fiikkopfcz5tvr60fmmb0.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fiikkopfcz5tvr60fmmb0.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Correcting Spelling Errors&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Excel’s Find and Replace feature is used to correct spelling mistakes or replace incorrect values across the dataset.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F4sb0pg1r9s11voekn8i8.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F4sb0pg1r9s11voekn8i8.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Frlkw0sf3vpkcdv9e9tzh.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Frlkw0sf3vpkcdv9e9tzh.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwj63x0e04zb09th8c1k0.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwj63x0e04zb09th8c1k0.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Using Data Validation&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Data Validation restricts the type of data entered into a cell, reducing future errors. For example, limiting a column to accept only numbers or specific text values.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fcphr6s1gjiela2zjt2tv.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fcphr6s1gjiela2zjt2tv.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fimdqynx1isi1v9iniosz.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fimdqynx1isi1v9iniosz.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Data Sorting in Excel (Text, Number, Date)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Data sorting in Microsoft Excel refers to the process of arranging data in a specific order to make it easier to read, analyse, and compare. Excel allows sorting of text, numbers, and dates in both ascending and descending order.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Text sorting arranges data alphabetically from A to Z or Z to A.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fcdgjfpssi48r3q99k634.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fcdgjfpssi48r3q99k634.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fi5rj2vuebetgdqdf7lij.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fi5rj2vuebetgdqdf7lij.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Floc71m70peesz33fxmfz.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Floc71m70peesz33fxmfz.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;Number sorting organises values from Smallest to Largest or Largest to Smallest.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Date sorting orders records from Oldest to Newest or Newest to Oldest.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Sorting helps users quickly identify patterns, trends, within a dataset.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Filtering in Excel (Text, Number, Date)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Filtering enables users to display only the data that meets specific criteria while temporarily hiding the rest.&lt;/p&gt;

&lt;p&gt;Text filters include options such as Equals, Contains, Begins With, and Ends With.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;Number filters allow filtering using conditions like Greater Than, Less Than, Between, and Top 10.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Date filters include options such as Today, This Month, This Year, and Between.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Filtering is essential for narrowing down large datasets to focus on relevant information without deleting any data.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fmpwci1mk82zzscmdqvgf.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fmpwci1mk82zzscmdqvgf.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Faa6jwmnkgjbxmvojg8ss.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Faa6jwmnkgjbxmvojg8ss.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Functions in Excel
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Arithmetic Functions
&lt;/h3&gt;

&lt;p&gt;Arithmetic functions perform basic mathematical calculations. Common arithmetic operations include addition, subtraction, multiplication, and division, which are used in formulas to compute values across cells.&lt;/p&gt;

&lt;h2&gt;
  
  
  Basic Data Analysis Tasks in Excel
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Calculating Average Salary&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Excel can be used to calculate the average salary of employees.&lt;/p&gt;

&lt;p&gt;Excel Formula:&lt;/p&gt;

&lt;p&gt;=AVERAGE(I2:I877)&lt;/p&gt;

&lt;p&gt;This helps HR understand the general salary level in the organization.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fpu36oat5a6tzyu2qhqxk.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fpu36oat5a6tzyu2qhqxk.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fhzd4tkuuzn9pcu5fnwuv.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fhzd4tkuuzn9pcu5fnwuv.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Counting Employees by Department&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Using COUNTIF, Excel can count how many employees work in each department.&lt;/p&gt;

&lt;p&gt;Example:&lt;/p&gt;

&lt;p&gt;=COUNTIF(I2:I877,"HR")&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Flcjymhlvsib94cdtt5wk.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Flcjymhlvsib94cdtt5wk.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2h8wheecbjob7pehipr6.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2h8wheecbjob7pehipr6.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;This helps HR see department distribution.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Finding Highest and Lowest Salary&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Excel functions help identify salary ranges.&lt;/p&gt;

&lt;p&gt;Formulas:&lt;/p&gt;

&lt;p&gt;=MAX(I2:I877)&lt;br&gt;
=MIN(I2:I877)&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F5cy4h5892tuq2q7dpc3r.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F5cy4h5892tuq2q7dpc3r.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fuchfyvlfbhkc8b66q9dv.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fuchfyvlfbhkc8b66q9dv.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F1vit1540d02s7k8fems5.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F1vit1540d02s7k8fems5.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;This is useful for compensation analysis.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Analyzing Experience Levels&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Excel can calculate the average years of service:&lt;/p&gt;

&lt;p&gt;=AVERAGE(R22:R877)&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fk0gjinjypqbpwa3zrdqy.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fk0gjinjypqbpwa3zrdqy.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fdsi4ae7fkdd6kj78ovrh.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fdsi4ae7fkdd6kj78ovrh.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;This shows how experienced the workforce is&lt;/p&gt;

&lt;h3&gt;
  
  
  Logical Functions
&lt;/h3&gt;

&lt;p&gt;Logical functions evaluate conditions and return results based on whether those conditions are TRUE or FALSE.&lt;br&gt;
The most commonly used logical function is:&lt;/p&gt;

&lt;p&gt;IF – Returns one value if a condition is TRUE and another value if it is FALSE.&lt;br&gt;
Using the IF Function&lt;br&gt;
Example 1: Salary Category (Low or High)&lt;/p&gt;

&lt;p&gt;HR wants to know whether an employee’s salary is Low or High.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fpxysfdbzv4a05f0c8nzy.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fpxysfdbzv4a05f0c8nzy.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fsg5u26ektj8mlc9a136t.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fsg5u26ektj8mlc9a136t.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Date Functions
&lt;/h2&gt;

&lt;p&gt;Date functions in Excel work with dates and time values. They are used to calculate durations, extract parts of a date, and perform date-based analysis, such as calculating the difference between dates or determining deadlines.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fg5fdz2lsqptmb1r5rcbe.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fg5fdz2lsqptmb1r5rcbe.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Freybjhgjaek375ret0q2.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Freybjhgjaek375ret0q2.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F4l2edalvq1nocm3lqlwy.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F4l2edalvq1nocm3lqlwy.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fv03xnh0iwjzu2tw5zh2t.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fv03xnh0iwjzu2tw5zh2t.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fo1hne8jtygxkt9we6njz.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fo1hne8jtygxkt9we6njz.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F1xsz43u3i7nu2dnczdgn.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F1xsz43u3i7nu2dnczdgn.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  PivotTables in Microsoft Excel
&lt;/h2&gt;

&lt;p&gt;A PivotTable is a data summarization tool in Microsoft Excel that automatically sorts, counts, totals, or averages data stored in a worksheet. It allows users to reorganize and group data dynamically without changing the original dataset.&lt;/p&gt;

&lt;h2&gt;
  
  
  PivotCharts in Microsoft Excel
&lt;/h2&gt;

&lt;p&gt;A PivotChart is a visual representation of data from a PivotTable. It updates automatically when the PivotTable changes, making it ideal for interactive analysis.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0zaz8p5zhcp91n2yorbf.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0zaz8p5zhcp91n2yorbf.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Types of Pivot Charts and Their Specific Uses&lt;br&gt;
Column Chart&lt;/p&gt;

&lt;p&gt;Use: Comparing values across categories&lt;/p&gt;

&lt;p&gt;Bar Chart&lt;/p&gt;

&lt;p&gt;Use: Comparing categories with long labels&lt;br&gt;
Best for: Performance comparisons&lt;/p&gt;

&lt;p&gt;Line Chart&lt;/p&gt;

&lt;p&gt;Use: Showing trends over time&lt;br&gt;
Best for: Monthly sales, yearly growth&lt;/p&gt;

&lt;p&gt;Pie Chart&lt;/p&gt;

&lt;p&gt;Use: Showing proportions of a whole&lt;br&gt;
Best for: Market share, percentage contribution&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F8w3hbgdfq0pz30la9z7c.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F8w3hbgdfq0pz30la9z7c.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Slicers in Microsoft Excel
&lt;/h2&gt;

&lt;p&gt;A Slicer is a visual filtering tool that allows users to filter PivotTables and PivotCharts interactively using buttons.&lt;/p&gt;

&lt;h3&gt;
  
  
  Connecting Slicers to Multiple PivotTables
&lt;/h3&gt;

&lt;p&gt;Connecting slicers allows one slicer to control multiple PivotTables and PivotCharts, ensuring consistent filtering across the dashboard.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fth1z7wj7m728w9pst9fe.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fth1z7wj7m728w9pst9fe.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Creating an Ineractive Excel Dashboard
&lt;/h2&gt;

&lt;p&gt;An Excel Dashboard is a visual summary that displays key performance indicators (KPIs), charts, and insights.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fw827nvce6nbk5vsyr84s.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fw827nvce6nbk5vsyr84s.png" alt=" " width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

</description>
      <category>webdev</category>
      <category>beginners</category>
      <category>learning</category>
      <category>datascience</category>
    </item>
    <item>
      <title>Understanding version control:How Git and GitHub play a vital role.</title>
      <dc:creator>maggy njuguna</dc:creator>
      <pubDate>Sat, 17 Jan 2026 12:32:20 +0000</pubDate>
      <link>https://forem.com/maggy_njuguna/understanding-version-controlhow-git-and-github-play-a-vital-role-3f1a</link>
      <guid>https://forem.com/maggy_njuguna/understanding-version-controlhow-git-and-github-play-a-vital-role-3f1a</guid>
      <description>&lt;h2&gt;
  
  
  Version control
&lt;/h2&gt;

&lt;p&gt;This is a system that enables tracking changes made to files over time.It enables collaboration between people working on same projects without interfering with each other's work.Git and GitHub are essential tools in version control.&lt;br&gt;
&lt;strong&gt;Git&lt;/strong&gt; A version control system used to track  changes made on files and also allows collaboration of teams dealing with same projects by recording every change made.&lt;br&gt;
&lt;strong&gt;Git Bash&lt;/strong&gt; It allows Windows users to fully utilise Git by acting as a command line.&lt;br&gt;
&lt;strong&gt;GitHub&lt;/strong&gt; An online platform that works with Git to manage and share code.It allows for many people to work on the same project without conflicts. It acts as a backup for projects being done and allows sharing projects with others.&lt;/p&gt;

&lt;h2&gt;
  
  
  Setting up git
&lt;/h2&gt;

&lt;p&gt;On Windows Git Bash is used as a command line for Git.The following commands are used to configure your user name and email.&lt;br&gt;
&lt;code&gt;git config --global user.name"name"&lt;/code&gt;&lt;br&gt;
&lt;code&gt;git config --global user.email"email address"&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;The process of creating a folder and adding it to Git&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Action&lt;/strong&gt;           &lt;strong&gt;Command&lt;/strong&gt;&lt;br&gt;
Create folder        &lt;code&gt;mkdir folder_name&lt;/code&gt;&lt;br&gt;
Enter folder         &lt;code&gt;cd folder_name&lt;/code&gt;&lt;br&gt;
Go back              &lt;code&gt;cd ..&lt;/code&gt;&lt;br&gt;
Create file          &lt;code&gt;touch file.txt&lt;/code&gt;&lt;br&gt;
Check location       &lt;code&gt;pwd&lt;/code&gt;&lt;br&gt;
Check git status     &lt;code&gt;git status&lt;/code&gt;&lt;br&gt;
Add to git           &lt;code&gt;git add folder_name&lt;/code&gt;&lt;br&gt;
Commit               &lt;code&gt;git commit -m "message"&lt;/code&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Initialising Git
&lt;/h2&gt;

&lt;p&gt;Once you create a project folder initialise Git using this command&lt;br&gt;
&lt;code&gt;git init&lt;/code&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Adding files to be tracked
&lt;/h2&gt;

&lt;p&gt;Use this command &lt;code&gt;git add filename&lt;/code&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Pushing code to Git Hub
&lt;/h2&gt;

&lt;p&gt;This is the process of uploading changes made on your projects from your computer to an online platform including GitHub and Git Lab.It is important because it enables you to share your work with your teammates,it updates the project repository with the latest changes and also keeps a backup of your work online.&lt;/p&gt;

&lt;h3&gt;
  
  
  Commands to push code
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Add files to staging area
This is choosing what changes of your code or project you want saved.It is done using this command&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;code&gt;git add filename&lt;/code&gt;&lt;br&gt;
&lt;code&gt;git add .&lt;/code&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Committing your changes
This involves saving a clear snapshot of your project for easy collaboration,tracking progress and avoiding mistakes.The following command is used&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;code&gt;git commit -m "a description of what you changed"&lt;/code&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Pushing changes to a remote repository
Involves sending changes saved(commits)from your computer to a remote repository (GitHub).Done using this command&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;code&gt;git remote add origin git@gitHub.com:username/repo-name.git&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;push your changes to GitHub&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;code&gt;git push -u origin main&lt;/code&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Pulling code
&lt;/h2&gt;

&lt;p&gt;This is the process of downloading latest changes made on your project from an online repository to you computer(local).&lt;br&gt;
Done using this command&lt;/p&gt;

&lt;p&gt;&lt;code&gt;git pull&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Conclusion&lt;/strong&gt;&lt;br&gt;
Understanding version control system is very important for data scientists and analysts.Git and GitHub enable data analysts to effectively manage their projects and share their work with others.&lt;/p&gt;

</description>
      <category>webdev</category>
      <category>git</category>
    </item>
  </channel>
</rss>
