The Hard-Won Python Expert Checklist

Python is a joy when you’re writing a script.

But when you’re managing a CLI tool, scaling a monorepo, debugging memory leaks, and surviving async + multiprocessing… it bites.

This post is not a tutorial. It’s a battle log — things you only learn after deploying, failing, fixing, and repeating.

You’ll find what went wrong, why, and how to do it better.

📚 Table of Contents

1. Project Structure Is Half the Battle
2. Know the Standard Library Inside Out
3. Debug Like a Surgeon
4. Async Threads and Processes Pick the Right Tool
5. Be a Packaging Minimalist
6. Master Pythons Object Model
7. Testing–CLI-Go-Native-Before-You-Go-Fancy
8. Real Tips from Monorepo Hell
9. Pythonic-Isnt-Just-Style–Its-Predictability
10. Mutable Default Arguments Will Betray You
11. Dont-Use-__del__-for-Cleanup
12. Circular Imports Are Real
13. Never Swallow All Exceptions
14. Float Precision Lies
15. Use-__slots__-Only-for-Performance-Constrained-Code
16. Mock Responsibly
17. Optimize Import Time in CLI Tools
18. Multiprocessing Can Blow Up
Final Words What Makes You a Python Expert

1. 🧱 Project Structure Is Half the Battle

🔥 The Pain:

Deep folder nesting (src/tool/core/helpers/a/b/c.py)
Inconsistent imports across tools or notebooks
ModuleNotFoundError without clear cause

✅ Best Practices:

Use a flat package layout unless you’re building a library
Choose: either one pyproject.toml, or fully separate packages
Use glue modules to centralize imports
For CLI tools, avoid shared state between subtools

# glue_module.py
from core.engine.runner import Runner
__all__ = ["Runner"]

2. 🧰 Know the Standard Library Inside Out

✅ Why It Matters:

Every Python dev installs requests, pathlib, or click — even though 80% of the job can be done with the standard library.

📚 Essential stdlib modules to master:

File & path: pathlib, os, shutil, tempfile
Functional: functools, itertools, operator
Async/concurrency: asyncio, concurrent.futures, threading
Testing: unittest, doctest, warnings
Serialization: json, pickle, csv, struct
Debugging: logging, traceback, pdb
Introspection: inspect, sys, dis, gc

from functools import lru_cache

@lru_cache
def fib(n): return n if n < 2 else fib(n-1) + fib(n-2)

3. 🔍 Debug Like a Surgeon

💥 Common Failures:

Printing everywhere
Swallowing all exceptions
Blindly reading logs

✅ Expert Tools & Practices:

pdb.set_trace() for live inspection
logging.exception() captures trace + message
traceback.format_exc() for structured logging
tracemalloc to trace memory allocation
gc to find unreachable or leaked objects

import logging
try:
    some_func()
except Exception:
    logging.exception("Failed during execution")

4. 🧵 Async, Threads, and Processes: Pick the Right Tool

💥 Common Pitfalls:

Blocking the event loop with time.sleep() instead of await asyncio.sleep()
Using asyncio.run() inside another coroutine (raises RuntimeError)
Starting threads inside async code without synchronization
Misusing raw multiprocessing instead of a cleaner abstraction

✅ When to Use What:

Scenario	Tool	Why
Network I/O, APIs	`asyncio`	Lightweight coroutines, high throughput
CPU-bound tasks	`ProcessPoolExecutor`	Simplifies multicore usage
Blocking I/O (e.g. disk)	`ThreadPoolExecutor`	Avoids blocking the main thread

🧠 Use concurrent.futures for both threads and processes. Only reach for multiprocessing.Process if you need shared memory, custom IPC, or manual lifecycle control.

🔍 Code Examples

# Async I/O
import asyncio

async def fetch_data():
    await asyncio.sleep(1)
    return {"data": 42}

async def main():
    results = await asyncio.gather(*(fetch_data() for _ in range(5)))
    print(results)

asyncio.run(main())

# CPU-bound tasks
from concurrent.futures import ProcessPoolExecutor

def fib(n):
    if n < 2: return n
    return fib(n-1) + fib(n-2)

with ProcessPoolExecutor() as executor:
    results = list(executor.map(fib, [30, 31, 32]))
print(results)

# Threaded I/O
from concurrent.futures import ThreadPoolExecutor
import time

def read_file(name):
    time.sleep(1)
    return f"{name} read complete"

with ThreadPoolExecutor() as pool:
    results = list(pool.map(read_file, ["file1", "file2"]))
print(results)

5. 📦 Be a Packaging Minimalist

😫 Why Packaging Hurts So Many Devs

pip install -e . behaves differently than poetry install
requirements.txt becomes stale, pyproject.toml gets out of sync
You can’t tell which script owns which dependency
CLIs don’t register unless explicitly declared

✅ What to Actually Do

📌 Stick to One Tool

Use Poetry or hatch — don’t mix
Keep one pyproject.toml at the project root

🛠 Define Entry Points

[tool.poetry.scripts]
mytool = "my_package.cli:main"

🧪 For Development

poetry install --sync

🚀 For Production

python -m build
twine upload dist/*

Or install directly:

pip install dist/my_package-0.1.0-py3-none-any.whl

📋 Export Pin-Locked Requirements

poetry export -f requirements.txt --without-hashes > requirements.txt

6. 🐍 Master Python’s Object Model

Understanding Python’s object model turns you from a user of the language into a toolmaker.

Whether you’re designing APIs, optimizing memory, or building extensible systems — it all comes back to the object model.

🔎 Why You Need to Care

🧠 Python treats everything as an object — yes, even functions and modules.
🧊 You can optimize memory via __slots__.
🧰 You can define contracts using abc.ABC.
🧼 You can hook into dynamic behavior with __getattr__, __setattr__, or __getattribute__.
⚙️ You can build your own DSLs or plugins with metaclasses or descriptors.

🧪 Attribute Interception

Want to defer loading, inject behaviors, or proxy access?

class LazyLoader:
    def __getattr__(self, name):
        print(f"[LazyLoad] Loading {name}")
        return f"value_for_{name}"

ll = LazyLoader()
print(ll.db)  # → Loading db → value_for_db

This is how many frameworks implement “virtual” fields, config resolution, or lazy bindings.

🧊 Save Memory with slots

class Point:
    __slots__ = ("x", "y")  # Prevents __dict__ allocation

    def __init__(self, x, y):
        self.x = x
        self.y = y

Benefits:

50–70% memory savings when creating millions of objects
Faster attribute access (no dict lookup)
Prevents typos from creating unexpected attributes

Drawbacks:

No dynamic attributes
Limited compatibility with some tools (dataclasses, pickle)

🔐 Abstract Base Classes (ABCs)

Use abc when you want to define formal interfaces:

from abc import ABC, abstractmethod

class Engine(ABC):
    @abstractmethod
    def start(self): ...

class GasEngine(Engine):
    def start(self):
        return "Vroom"

g = GasEngine()
print(g.start())  # ✅ Works

If you forget to implement start(), Python will raise an error at class construction time — not during runtime. That’s safer.

🧬 Dynamically Create Classes

This powers things like plugin systems, serializers, or even custom ORM models:

MyType = type("MyType", (object,), {"x": 42})
print(MyType().x)  # → 42

Yes, type is also a class — and this is what metaclasses build upon.

🧙 Bonus: Context Managers with Dunder Methods

class FileWriter:
    def __enter__(self):
        self.file = open("log.txt", "w")
        return self.file
    def __exit__(self, *exc):
        self.file.close()

with FileWriter() as f:
    f.write("Hello world")

This is how Python’s with block works — and why open(...) is so elegant.

🧠 TL;DR: Python’s object model is not “advanced” — it’s foundational. It’s the key to building elegant, maintainable, powerful code.

7. 🧪 Testing & CLI? Go Native Before You Go Fancy

import argparse

def main():
    parser = argparse.ArgumentParser()
    parser.add_argument("--dry-run", action="store_true")
    args = parser.parse_args()

import unittest

class MathTest(unittest.TestCase):
    def test_add(self):
        self.assertEqual(1 + 1, 2)

💡 Use argparse, unittest, and doctest by default. Reach for click, pytest, or typer only when needed.

8. 🔥 Real Tips from Monorepo Hell

Monorepos sound great — until you hit circular imports, ambiguous entry points, or dependency chaos.

😱 Common Issues:

Deep, fragile import paths like from myproj.foo.bar.baz.qux import Thing
Scripts that only work if run from project root
Mixing Poetry, pip, conda, PYTHONPATH hacks…
Circular imports from overconnected modules

✅ Best Practices:

1. 📦 Keep Your Layout Flat

Avoid over-nesting like src/tool/core/internal/engine/runner.py unless you must. Prefer:

tool/
  api.py
  cli.py
  core/
    logic.py
    runner.py

2. 🪞 Use Glue Modules to Flatten Imports

Avoid importing using full module paths. Create re-export modules near the root:

# shared/api.py
from .core.logic import fetch_data
from .core.runner import Runner

__all__ = ["fetch_data", "Runner"]

Then everywhere else:

from shared.api import fetch_data

Easy to change structure later — no need to rewrite dozens of imports.

3. ✅ One pyproject.toml, One Install

Install your tool like a real package — even during dev:

poetry install --sync

Don’t run uninstalled scripts via python path/to/script.py. That’s fragile.

4. 🚫 Never Use `PYTHONPATH`

It hides real import errors.
It breaks in containers, CI, and deployment tools.
Use relative imports inside packages and install your package locally.

5. 📦 Use `src/` Layout Only If You’re Publishing

If you’re building a library:

mytool/
  pyproject.toml
  src/
    mytool/
      __init__.py
      engine.py
      utils.py

This avoids accidentally importing from the working directory and enforces clean packaging.

💡 Your import paths reflect your architecture. If they look deep and ugly, your structure probably is too.

9. ✨ Pythonic Isn’t Just Style — It’s Predictability

Being “Pythonic” isn’t about flair — it’s about clarity, composability, and surprise-free code.

✅ Key Idioms:

# EAFP (easier to ask forgiveness)
try:
    value = config["key"]
except KeyError:
    value = default

# Clean and readable list comprehension
squares = [x * x for x in range(10)]

# Conditional comprehension — ✅ only when it's simple
signs = ["even" if x % 2 == 0 else "odd" for x in range(5)]

# ❌ Hard to read: nested conditions or logic
results = [func(x) for x in data if x > 0 and not is_skipped(x) and (x % 3 == 0 or x in special_set)]

# ✅ Better readability with for-loop
results = []
for x in data:
    if x > 0 and not is_skipped(x) and (x % 3 == 0 or x in special_set):
        results.append(func(x))

# zip + enumerate
for i, (a, b) in enumerate(zip(list1, list2)):
    ...

# Avoid this
funcs = [lambda x: x + i for i in range(3)]  # 💥 late binding bug

# Do this
funcs = [lambda x, i=i: x + i for i in range(3)]

Readable code is maintainable code. Pythonic is what feels natural to another Python dev.

10. 💣 Mutable Default Arguments Will Betray You

😱 The Hidden Trap

In Python, default arguments are evaluated once — at function definition time, not each time the function is called.

This leads to a shared mutable object across calls.

def append(item, items=[]):  # BAD
    items.append(item)
    return items

print(append("a"))  # ['a']
print(append("b"))  # ['a', 'b'] ← Unexpected!

Instead of starting fresh every time, you’re modifying the same list.

✅ The Idiomatic Fix

def append(item, items=None):
    if items is None:
        items = []
    items.append(item)
    return items

Now it works as expected:

print(append("a"))  # ['a']
print(append("b"))  # ['b']

💡 For the Type-Safe Folks

If you’re using type hints:

from typing import Optional, List, Any

def append(item: Any, items: Optional[List[Any]] = None) -> List[Any]:
    if items is None:
        items = []
    items.append(item)
    return items

🧠 When Might You Intentionally Use It?

Rarely, but if you want shared state:

def cache(value, _cache={}):
    if value not in _cache:
        _cache[value] = expensive_compute(value)
    return _cache[value]

Just comment it clearly — because it’s often misunderstood.

⚠️ TL;DR: Never use mutable defaults unless you’re doing it on purpose and you’re absolutely sure it’s safe.

11. ⛔ Don’t Use `del` for Cleanup

__del__ is not reliable for releasing resources — especially with cyclic references or abrupt exits.

😬 What Can Go Wrong:

It may not get called at all
It can resurrect objects by mistake
It’s called during interpreter shutdown (when globals may be gone)

✅ Use Context Managers Instead:

class FileWriter:
    def __enter__(self):
        self.file = open("out.txt", "w")
        return self.file
    def __exit__(self, exc_type, exc_val, exc_tb):
        self.file.close()

with FileWriter() as f:
    f.write("Hello!")

Or use contextlib:

from contextlib import contextmanager

@contextmanager
def open_writer(path):
    f = open(path, "w")
    try:
        yield f
    finally:
        f.close()

12. 🔄 Circular Imports Are Real

They creep in silently and break your app at runtime.

Example:

# a.py
from b import foo
def bar(): pass

# b.py
from a import bar  # 💥 bar not defined yet
def foo(): pass

✅ Fix: Move Shared Logic

Refactor to a common module:

# shared.py
def bar(): ...
def foo(): ...

Circular imports are a design smell — your modules are too interdependent.

13. ☠️ Never Swallow All Exceptions

try:
    risky()
except:
    pass  # BAD

except Exception:
    logging.exception("Something failed")

14. 📏 Float Precision Lies

IEEE 754 is the root cause. Python follows it faithfully — and that causes surprises.

😱 Surprise:

>>> 0.1 + 0.2
0.30000000000000004

✅ Use decimal for Financials:

from decimal import Decimal
Decimal("0.1") + Decimal("0.2") == Decimal("0.3")  # True

✅ Use math.isclose() for Scientific Work:

import math
math.isclose(0.1 + 0.2, 0.3, rel_tol=1e-9)  # ✅

15. 🧊 Use `slots` Only for Performance-Constrained Code

🧠 What It Does:

Removes__dict__, saving memory
Makes attribute access faster (like a C struct)

✅ When to Use:

You’re creating millions of small objects
The class has fixed fields only

class Node:
    __slots__ = ("name", "value")

⚠️ Drawbacks:

Can’t add new attributes
Can’t mix with regular classes unless careful
Doesn’t work with dataclass unless you use @dataclass(slots=True) (Python 3.10+)

🔍 Profile before using __slots__. It’s a micro-optimization that can backfire in dynamic apps.

16. 🧪 Mock Responsibly

Mocking lets you isolate your code under test — but bad mocking makes tests worse than useless.

😱 The Mess:

# tests/test_logic.py
@patch("utils.fetch_data")
def test_foo(mock_fetch):
    ...

Problems:

❌ Too much patching turns your test into a simulation of reality — not reality itself.
❌ Global patching leaks state between tests, leading to unpredictable behavior.
❌ Patching the wrong path (utils.fetch_data vs module_under_test.fetch_data) leads to “mock not applied” bugs.

✅ Good Mocking Practices

🔒 1. Patch Where the Function is Used, Not Where It’s Defined

# module_a.py
from utils import fetch_data

def run(): return fetch_data()

# tests/test_module_a.py
@patch("module_a.fetch_data")  # ✅ patch where it's used
def test_run(mock_fetch):
    mock_fetch.return_value = {"ok": True}
    assert run() == {"ok": True}

💡 If you patch utils.fetch_data, it won’t replace the already-imported reference in module_a.

🎯 2. Use with `patch()` to Control Scope

from unittest.mock import patch

def test_logic():
    with patch("module.logic.fetch_data") as fake:
        fake.return_value = {"status": "ok"}
        result = logic()
        assert result == ...

✅ Limits patching to one test
✅ Automatically restores the original function

🧪 3. Prefer Fixtures or Monkeypatching in pytest

# conftest.py
import pytest

@pytest.fixture
def fake_fetch(monkeypatch):
    monkeypatch.setattr("module.api.fetch_data", lambda: {"fake": True})

# test_logic.py
def test_using_fixture(fake_fetch):
    result = logic()
    assert result["fake"] is True

✅ Fixtures make setup/teardown explicit
✅ Easier to reuse across tests

🚫 4. Don’t Over-Mock Internal Logic

Instead of mocking everything, consider using integration-style tests that hit real code paths:

# Instead of mocking add():
def test_total_price():
    cart = Cart()
    cart.add(Product("apple", 1.0))
    assert cart.total() == 1.0

Mocks are great for external APIs, I/O, or long-running calls — not your own logic.

🧼 5. Always Clean Up

If you’re writing custom patches:

original = module.fetch
module.fetch = fake_func
...
module.fetch = original  # ⚠️ Risky and easy to forget

Just don’t. Use patch() or monkeypatch to avoid leaking state.

🧠 Good mocks don’t hide problems — they reveal structure. If mocking a function feels painful, your design probably needs refactoring.

17. 🐢 Optimize Import Time in CLI Tools

🐌 Problem:

import pandas as pd  # takes 300ms+

This runs even for –help.

✅ Defer Import:

def main():
    import pandas as pd
    ...

✅ Lazy Load with importlib:

def load_plugin(name):
    import importlib
    return importlib.import_module("plugins." + name)

⚡ Your CLI should feel snappy. A slow startup discourages usage and testing.

18. 🧨 Multiprocessing Can Blow Up

Multiprocessing gives you real parallelism by leveraging multiple CPU cores — but it’s notoriously finicky.

💥 What Goes Wrong:

Forking processes that inherit active threads → 💣 deadlocks or crashes
Child processes failing silently (no logs, no traceback)
Shared state gets duplicated, not shared
On macOS: default fork behavior is unsafe (especially with GUI, NumPy, etc.)

✅ Best Practices:

1. Use `concurrent.futures.ProcessPoolExecutor`

It wraps multiprocessing with sane defaults and cleaner API:

from concurrent.futures import ProcessPoolExecutor

def work(n): return n * n

with ProcessPoolExecutor() as pool:
    print(list(pool.map(work, range(4))))

2. Use the `spawn` Start Method (Especially on macOS or in Jupyter)

import multiprocessing as mp

ctx = mp.get_context("spawn")
with ctx.Pool() as pool:
    print(pool.map(work, range(4)))

"spawn" creates a clean new Python process (safe, slow)
"fork" (default on Linux) copies current process memory (fast, dangerous)

💡 On macOS or PyTorch workflows, always use "spawn" to avoid mysterious crashes.

3. Avoid Mixing Threads + Fork

import threading, multiprocessing

def run():
    print("In thread")

t = threading.Thread(target=run)
t.start()

# 💥 This can hang or segfault!
multiprocessing.Process(target=some_func).start()

If you really need both, start processes first.

Multiprocessing is powerful — but you must tame it. Debugging zombie processes or race conditions in forked workers is not for the faint of heart.

🔚 Final Words: What Makes You a Python Expert

It’s not about syntax or speed — it’s about:

Knowing how Python fails
Writing code that survives production
Debugging like a surgeon
Packaging like a product engineer

📚 Table of Contents

1. 🧱 Project Structure Is Half the Battle

🔥 The Pain:

✅ Best Practices:

2. 🧰 Know the Standard Library Inside Out

✅ Why It Matters:

📚 Essential stdlib modules to master:

3. 🔍 Debug Like a Surgeon

💥 Common Failures:

✅ Expert Tools & Practices:

4. 🧵 Async, Threads, and Processes: Pick the Right Tool

💥 Common Pitfalls:

✅ When to Use What:

🔍 Code Examples

5. 📦 Be a Packaging Minimalist

😫 Why Packaging Hurts So Many Devs

✅ What to Actually Do

📌 Stick to One Tool

🛠 Define Entry Points

🧪 For Development

🚀 For Production

📋 Export Pin-Locked Requirements

6. 🐍 Master Python’s Object Model

🔎 Why You Need to Care

🧪 Attribute Interception

🧊 Save Memory with slots

Benefits:

Drawbacks:

🔐 Abstract Base Classes (ABCs)

🧬 Dynamically Create Classes

🧙 Bonus: Context Managers with Dunder Methods

7. 🧪 Testing & CLI? Go Native Before You Go Fancy

8. 🔥 Real Tips from Monorepo Hell

😱 Common Issues:

✅ Best Practices:

1. 📦 Keep Your Layout Flat

2. 🪞 Use Glue Modules to Flatten Imports

3. ✅ One pyproject.toml, One Install

4. 🚫 Never Use PYTHONPATH

5. 📦 Use src/ Layout Only If You’re Publishing

9. ✨ Pythonic Isn’t Just Style — It’s Predictability

✅ Key Idioms:

10. 💣 Mutable Default Arguments Will Betray You

😱 The Hidden Trap

✅ The Idiomatic Fix

💡 For the Type-Safe Folks

🧠 When Might You Intentionally Use It?

11. ⛔ Don’t Use __del__ for Cleanup

😬 What Can Go Wrong:

✅ Use Context Managers Instead:

12. 🔄 Circular Imports Are Real

Example:

✅ Fix: Move Shared Logic

13. ☠️ Never Swallow All Exceptions

14. 📏 Float Precision Lies

😱 Surprise:

✅ Use decimal for Financials:

✅ Use math.isclose() for Scientific Work:

15. 🧊 Use __slots__ Only for Performance-Constrained Code

🧠 What It Does:

✅ When to Use:

⚠️ Drawbacks:

16. 🧪 Mock Responsibly

😱 The Mess:

✅ Good Mocking Practices

🔒 1. Patch Where the Function is Used, Not Where It’s Defined

🎯 2. Use with patch() to Control Scope

🧪 3. Prefer Fixtures or Monkeypatching in pytest

🚫 4. Don’t Over-Mock Internal Logic

🧼 5. Always Clean Up

17. 🐢 Optimize Import Time in CLI Tools

🐌 Problem:

✅ Defer Import:

✅ Lazy Load with importlib:

18. 🧨 Multiprocessing Can Blow Up

💥 What Goes Wrong:

✅ Best Practices:

1. Use concurrent.futures.ProcessPoolExecutor

2. Use the spawn Start Method (Especially on macOS or in Jupyter)

3. Avoid Mixing Threads + Fork

4. 🚫 Never Use `PYTHONPATH`

5. 📦 Use `src/` Layout Only If You’re Publishing

11. ⛔ Don’t Use `del` for Cleanup

15. 🧊 Use `slots` Only for Performance-Constrained Code

🎯 2. Use with `patch()` to Control Scope

1. Use `concurrent.futures.ProcessPoolExecutor`

2. Use the `spawn` Start Method (Especially on macOS or in Jupyter)