Spaces:
Sleeping
Sleeping
metadata
title: Data Preparation in the Shell
original_url: >-
https://tds.s-anand.net/#/data-preparation-in-the-shell?id=data-preparation-in-the-shell
downloaded_at: '2025-06-08T23:26:41.381829'
Data Preparation in the Shell
You’ll learn how to use UNIX tools to process and clean data, covering:
curl(orwget) to fetch data from websites.gzip(orxz) to compress and decompress files.wcto count lines, words, and characters in text.headandtailto get the start and end of files.cutto extract specific columns from text.uniqto de-duplicate lines.sortto sort lines.grepto filter lines containing specific text.sedto search and replace text.awkfor more complex text processing.
Here are the links used in the video:
[Previous
Data Aggregation in Excel](#/data-aggregation-in-excel)
[Next
Data Preparation in the Editor](#/data-preparation-in-the-editor)
