Spaces:
Sleeping
Sleeping
metadata
title: 5. Data Preparation
original_url: https://tds.s-anand.net/#/data-preparation?id=data-preparation
downloaded_at: '2025-06-08T23:22:16.649843'
Data Preparation
Data preparation is crucial because raw data is rarely perfect.
It often contains errors, inconsistencies, or missing values. For example, marks data may have ‘NA’ or ‘absent’ for non-attendees, which you need to handle.
This section teaches you how to clean up data, convert it to different formats, aggregate it if required, and get a feel for the data before you analyze.
Here are links used in the video:
- Presentation used in the video
- Scraping assembly elections - Notebook
- Assembly election results (CSV)
pdftotextsoftware- OpenRefine software
- The most persistent party
- TN assembly election cartogram
[Previous
Scraping: Live Sessions](#/scraping-live-sessions)
[Next
Data Cleansing in Excel](#/data-cleansing-in-excel)
