Extract. Transform. Read.A newsletter from Pipeline Hi past, present or future data professional! I dreaded entering the job market after my data science master's. I felt like I knew more than a data analyst but less than a professional data scientist. I've since realized my program was more effective than I thought, but it couldn't prepare me for the key areas like cloud deployments and real-world problem-solving I had to learn on the job as a data engineer. And I’ve noticed these gaps in many other data science curricula, not just my own. My hope is that this can help manage your expectations if you're considering a data science MS, identify areas where you need to upskill. Most importantly, I want to reassure you no program can teach you everything. Gaps In The Cloud My entire grad school experience was constrained to a local environment. My coding was done in Jupyter Notebook VMs, but the focus was on the code itself, not the infrastructure it ran on. The only kind of development process I knew of was GitHub, and even that wasn’t entirely professional, because we each had our own individual repos. This left me unprepared for cloud deployments. I had no idea how to provision a virtual machine or orchestrate a simple pipeline. The good news is that you can quickly close this gap. Both GCP and AWS offer free tiers and resources. At a minimum, you should learn how to provision a VM, orchestrate simple pipelines (like with Airflow), schedule processes, and visualize data from a data warehouse. Beyond Basic SQL While my program introduced database fundamentals, the emphasis on SQL was lacking. We used SQLite and PostgreSQL, but never really dove into the intermediate or advanced concepts. I was largely self-taught in SQL, and it was my least confident skill when I started my job. The reality is that production-level work requires you to write and debug large, complex queries. My advice is to focus on problem-solving over memorizing functions. Learn to write bloated queries to understand what not to do, and efficient ones to understand optimization. From "Mad Libs" to Real-World Requirements My biggest critique of school coding courses is the "color-by-number" approach, where you’re just filling in blanks or, more accurately, Jupyter Notebook cells. While this is great for learning syntax, it completely misses the point of coding, which is the process of creative problem-solving. School projects often provide a perfectly defined problem, but in the real world, you'll be translating vague business requirements into a technical solution. To prepare, you can simulate this process on your personal projects. Before you write a single line of code, ask yourself: What data do I need? Who is the end user? How will I accomplish this? When does it need to be done? And most importantly, why am I using this method? Answering these questions will distinguish you in job interviews and give you a huge head start. Before You Go… If you’re still unsure of how to make your projects more professional, I just finished (writing, not reading!) a new ebook I plan to release within the next month. Its goal is to lend structure to the project development process so you can create work that makes you proud and hiring managers salivate. Join the waitlist to get exclusive early access. Thanks for ingesting, -Zach Quinn |
Reaching 20k+ readers on Medium and nearly 3k learners by email, I draw on my 4 years of experience as a Senior Data Engineer to demystify data science, cloud and programming concepts while sharing job hunt strategies so you can land and excel in data-driven roles. Subscribe for 500 words of actionable advice every Thursday.
Extract. Transform. Read. A newsletter from Pipeline Hi past, present or future data professional! If you live in the U.S., this week marks the end of back to school season; though, if you’re like my southern relatives, you’ve been back since July. The closest feeling most adults get to back to school (aside from the teachers), is starting a new job. While a new org, title and compensation package represents new opportunities, it’s also easy to feel like the “new kid”, which can lead to being...
Extract. Transform. Read. A newsletter from Pipeline Hi past, present or future data professional! I once participated in a remote job interview in which the interviewer was on the video call while driving... and smoking. While that instance was among the most memorable interview experiences (for the wrong reasons), I’ve had just as many interviews that have blended together and faded into the recesses of my mind. The common denominator, however, was the insistence on asking one question. The...
Extract. Transform. Read. A newsletter from Pipeline Hi past, present or future data professional! One of the most loaded terms, after AI, is upskilling. It’s something everyone should always be doing, yet, only the most dedicated can consistently dedicate time to learning and expanding beyond their comfort zones. If you’re on the path to becoming a data professional, you’ve probably spent countless hours learning, only to find yourself wondering if you’re actually making progress. I’ve been...