← All articles
CareerLearningInfrastructure

The Best Way to Learn Infrastructure Engineering

18 March 2026 · 10 min

The Uncomfortable Truth About Learning Infrastructure

Most people trying to break into infrastructure engineering are doing it wrong. Not through any fault of their own — the ecosystem actively encourages the wrong behaviours. YouTube channels rack up millions of views. Certification prep courses promise job-readiness in twelve weeks. Boot camps sell the dream of a cloud role in three months.

None of it works as advertised. Not because the content is bad, but because passive consumption of information is a remarkably poor way to build the kind of judgement that infrastructure work actually demands.

Infrastructure engineering is not a knowledge domain. It is a reasoning domain. The job is not to recall what a TCP handshake looks like — it is to diagnose why connections are being dropped at 2 a.m. on a Tuesday when three things changed in the last deploy. That requires a different kind of learning entirely.

Why Passive Learning Fails

Hermann Ebbinghaus mapped the forgetting curve in the 1880s, and nothing about it has improved since. Without active reinforcement, you will forget roughly 50% of what you read or watch within a day, and close to 90% within a week. Watching a four-hour Kubernetes tutorial on a Sunday afternoon is not learning — it is the illusion of learning, which is arguably worse, because it creates false confidence.

The problem is compounded by the nature of infrastructure knowledge itself. A firewall rule is not a fact you memorise; it is a consequence of understanding network flow, trust zones, and stateful inspection. If you memorise the rule without the model, you cannot adapt it when the topology changes. You are left cargo-culting configuration from Stack Overflow and hoping nothing breaks.

Passive learning gives you facts without models. The moment reality diverges from the tutorial — which it always does — you are lost.

Spaced Repetition Is Not Optional

The antidote to the forgetting curve is spaced repetition: revisiting material at increasing intervals, forcing your brain to reconstruct the knowledge rather than simply recognise it. It is not glamorous, and it does not feel like progress in the way that finishing a video course does. But the evidence is unambiguous.

For infrastructure specifically, spaced repetition works best when it is scenario-driven rather than flashcard-driven. Instead of asking yourself "what port does SSH use?", ask yourself "a user cannot SSH into a host — walk through every layer you would check". The second question exercises the mental model. The first exercises trivia.

This is why ShellQuest's daily practice is built around scenarios rather than definitions. Five minutes of active recall in the morning beats an hour of passive revision at the weekend.

The Scenario-First Approach

The single most effective way to accelerate infra learning is to spend the majority of your time inside problems, not reading about them. An incident scenario — even a simulated one — forces you to:

  • Form a hypothesis about what is broken
  • Design a test to confirm or rule it out
  • Interpret the output of that test
  • Update your mental model based on what you find

This is the diagnostic loop that senior engineers run automatically. It is a skill, not a trait, and it is built by repetition under realistic conditions.

The best simulated environment is one that behaves like a real system: processes misbehave, logs are noisy, and the obvious answer is usually wrong. If you want to practise this kind of thinking today, the ShellQuest diagnostic is designed exactly for this — realistic Linux and networking faults.

Build Something. Break It.

No lab environment will fully replace running real hardware and software yourself. A homelab does not have to be expensive — a secondhand SFF machine for £60, a Raspberry Pi cluster, or even a handful of virtual machines on your laptop will do. The goal is not to replicate production; it is to give yourself a sandbox where consequences are real enough to matter but recoverable enough that you experiment freely.

Here is a minimal starting point:

# Spin up VMs with Vagrant and VirtualBox — one as a router, two as nodes
vagrant init ubuntu/jammy64
# Edit Vagrantfile to define private_network addresses
vagrant up
# Now practise: misconfigure iptables, watch traffic drop, fix it
iptables -A INPUT -p tcp --dport 22 -j DROP
# SSH into the VM from the host — what happens? How do you recover?

Breaking things intentionally and recovering from them is worth ten hours of watching someone else do it. The anxiety of "I have locked myself out" is the feeling of a mental model forming.

Read Postmortems. Read Them Seriously.

Other people's production incidents are one of the most underused learning resources in the industry. Google's SRE book appendix, Cloudflare's incident blog, AWS's service health history — these are detailed accounts of exactly the kind of reasoning you are trying to build.

When you read a postmortem, do not skim to the "root cause" section. Follow the timeline. At each step, pause and ask yourself what you would have checked next. Then read what the team actually checked. The gap between your hypothesis and theirs is your learning target.

Learn Things in the Right Order

One of the most common mistakes is jumping to the interesting-sounding technologies before the foundations are solid. Kubernetes is not beginner material. Neither is Terraform. Both make far more sense once you understand what they are abstracting.

StageFocusWhy it comes first
1 — CoreLinux, shell, networking fundamentalsEverything runs on Linux; every system communicates over a network
2 — AutomationBash scripting, Ansible, basic PythonYou cannot automate what you do not understand manually
3 — Scale / SREKubernetes, observability, incident responseMeaningful only once you know what you are orchestrating
4 — SpecialisationSecurity, data engineering, platform engBuild on solid foundations, not before them

The Linux track and networking track cover stages one and two in depth. If you are not confident with ss, tcpdump, systemctl, and journalctl, spend time there before touching a cloud platform. The concepts library fills in the gaps — things like TLS termination, NAT traversal, and cgroup resource limits that courses often skip because they are not photogenic.

Make It Stick: Daily Practice and Weak-Area Review

Consistency beats intensity. Thirty minutes a day, every day, will compound faster than a weekend binge once a month. The reason is simple: sleep consolidates memory. Spacing your practice across multiple days means your brain has more opportunities to move material into long-term storage.

More importantly, track what you get wrong. Most learners avoid their weak areas because revisiting them feels bad. This is precisely backwards. The concepts you consistently fumble are the ones delivering the highest return when you finally nail them.

Instead of thisDo this
Watching a full Linux course from scratchPractise specific weak areas daily
Grinding certification dumpsWork through real scenario problems
Reading documentation end-to-endRead docs when you hit a real problem
Starting with KubernetesStart with Linux process management
Hoping lab experience transfersDeliberately debrief each lab session

The Mindset Shift

The engineers who progress fastest are not the ones who know the most facts. They are the ones who are most comfortable saying "I do not know what is happening yet" and then systematically working through the problem until they do. That comfort comes from having practised the diagnostic loop enough times that it feels natural.


If you are serious about building real infra skills, start with ShellQuest's daily practice — five to ten minutes of scenario-based exercises designed to build the reasoning habits that matter. Not sure where you stand? The diagnostic will tell you honestly. And if you want structured depth across Linux, networking, and the SRE discipline, join the waitlist for full access to the curriculum as it expands. No hype. Just deliberate practice.

Liked this? ShellQuest turns these mental models into puzzles and labs you can actually practise.

Join the waitlist