Data Engineer's Guide to Linux: Why It Is Your Secret Weapon
If you are stepping into the world of data engineering, you will quickly realize that if SQL and Python are the languages spoken, then Linux is the room they are spoken in. Most modern infrastructu...

Source: DEV Community
If you are stepping into the world of data engineering, you will quickly realize that if SQL and Python are the languages spoken, then Linux is the room they are spoken in. Most modern infrastructure, from cloud servers to Docker containers run on Linux. This read is here to walk you through why Linux matters and help you manage the flow of data by navigating the terminal. Data engineering involves processing large amounts of data. While Windows and Mac are great for local development, they carry bloat. (GUI updates & background updates which enable smooth rendering of desktop icons). Since Linux is headless, it means that 100% of RAM & CPU goes towards processing queries & scripts, not rendering desktop icons. Other advantages of working with Linux include:- It gives you granular control over memory and CPU which is important when working with terabytes of data Automation. It has crontab which is the Linux scheduler. Here you can script repetitive tasks like moving files o