In this project, i am cleaning raw data using MySQL and it turn out that this is realy easy to use more often if you are getting to know about the data it self.
Read More
As a data analyst, one of the most critical steps before performing any analysis is data cleaning. No matter how sophisticated your dashboard or model is, if the data is messy, the outcome will likely be misleading.
In many of my projects—especially those involving MySQL as the main data source—data cleaning serves as a foundational task. Below, I’ll walk through several real-world data cleaning techniques I’ve applied using SQL queries in MySQL.
SELECT * FROM sales WHERE customer_name IS NULL;
Once identified, missing values can be:
DELETE FROM sales
WHERE id NOT IN (
SELECT MIN(id) FROM (SELECT * FROM sales) AS temp
GROUP BY order_id
);
I often use MIN(id) to retain one original row while eliminating duplicate records based on a unique column like order_id.
UPDATE customers SET customer_name = TRIM(customer_name);
TRIM() is a simple but powerful function to ensure clean strings, especially useful for preventing errors during JOIN operations.
SELECT STR_TO_DATE(order_date, '%d/%m/%Y') AS formatted_date FROM orders;
Using STR_TO_DATE, I make sure all date formats are consistent so functions like YEAR() or MONTH() work flawlessly.
UPDATE orders
SET ship_mode = 'Standard Class'
WHERE ship_mode IN ('standard', 'std', 'STD');
Normalizing category values avoids data being split into several variants that should actually be grouped together.
In one of my sales analysis projects, I discovered that over 15% of the data contained duplicates or inconsistent labels. After cleaning, the analysis results became much more reliable, helping the business make smarter, data-driven decisions.
Data cleaning is not just about technical skills—it's about understanding context. With MySQL, I can clean, normalize, and prepare raw data efficiently to ensure quality insights.
If you're working with data, never skip this step. Because in analytics, quality > quantity.
Lorem ipsum dolor, sit amet consectetur adipisicing elit. Tenetur adipisci facilis cupiditate recusandae aperiam temporibus corporis itaque quis facere, numquam, ad culpa deserunt sint dolorem autem obcaecati, ipsam mollitia hic.
In this project, i am cleaning raw data using MySQL and it turn out that this is realy easy to use more often if you are getting to know about the data it self.
Read More
Excel provide me more easy because at the end of the day, excel came up with huge feature and it make it more compatible since we are know the tools.
Read More
Lorem, ipsum dolor sit amet consectetur adipisicing elit. Eius libero soluta impedit eligendi? Quibusdam, laudantium.
Read More