Notes

A personal knowledge management (PKM) system for professional work in data science, machine learning, and AI. Built in Obsidian.


Purpose

This vault is a structured, long-lived knowledge base — not a note dump. It is organized by epistemic layer and abstraction level: each layer answers a distinct question, from why does this work mathematically? through to why are we solving this business problem?

The goal is to accumulate knowledge that compounds over time: timeless theory in foundations, analytical practice in data science, reusable model knowledge in modeling, proven engineering patterns in the engineering layers, domain use cases in applications, and concrete implementations in reference implementations.


Structure

notes/
├── 00_meta/                        # Vault governance: conventions, templates, dashboard, glossary
├── 01_foundations/                 # Timeless mathematical and theoretical foundations
├── 02_data_science/                # Analytical reasoning: from data to decisions
├── 03_modeling/                    # Model families, training, and model selection
├── 04_software_engineering/        # Programming languages, system design, tooling
├── 05_ml_engineering/              # ML production lifecycle: data, training, deployment, monitoring
├── 06_ai_engineering/              # Foundation model systems: RAG, fine-tuning, inference, LLMOps
├── 07_applications/                # Domain use cases and business application patterns
├── 08_implementations/   # Cross-layer executable patterns: system patterns and end-to-end examples (concept-specific impl lives with the concept)
├── 09_projects/                    # Time-bound execution instances
│   ├── _active/
│   ├── _completed/
│   └── _experimental/
├── 10_reading/                     # Structured intake: papers, books, articles, reports
├── 11_logs/                        # Operational memory: daily notes, reviews, brain dumps
└── 99_archive/                     # Retired content

Layer Guide

LayerGuiding QuestionExamples
01_foundationsWhy does this work mathematically?Calculus, linear algebra, backpropagation theory
02_data_scienceHow do we reason from data to decisions?EDA, feature engineering, experiment design, interpretability
03_modelingWhich model should we use and why?GLMs, neural networks, transformers, evaluation, regularization
04_software_engineeringHow do we build robust software?Python, Go, TypeScript, APIs, databases, testing, DevOps
05_ml_engineeringHow do we productionize ML systems?Feature stores, MLflow, drift detection, serving
06_ai_engineeringHow do we operate LLM systems?RAG, LoRA fine-tuning, quantization, LangSmith
07_applicationsWhy are we solving this problem?Fraud detection, churn prediction, document intelligence
08_implementationsHow is this cross-cutting executable pattern implemented?Reusable system patterns, end-to-end reference architectures (concept-specific impl → home layer)
09_projectsWhat are we executing right now?Active work, experiments
10_readingWhat have we learned from external sources?Paper notes, book summaries
11_logsWhat happened today / this week?Daily notes, reflections

Note Types

Notes follow templates matched to their layer (see 00_meta/templates/):

TemplateLayerStructure
tpl_foundation.md01_foundationsStatement → Intuition → Mathematical Formulation → Assumptions → Consequences
tpl_proof.md01_foundationsStatement → Assumptions → Proof → Notes
tpl_data_science.md02_data_scienceProblem Context → Analytical Goal → Data Considerations → Method → Validation
tpl_model.md03_modelingCore Idea → Math Formulation → Inductive Bias → Strengths/Weaknesses → Variants
tpl_software_note.md04_software_engineeringPurpose → Core Concepts → Design Notes → Trade-offs → Implementation Notes
tpl_ml_system.md05_ml_engineeringPurpose → Architecture → Data/Feature Flow → Operational Considerations → Trade-offs
tpl_ai_system.md06_ai_engineeringUse Case → Model Strategy → Evaluation → Guardrails → Architecture → Cost/Latency
tpl_application.md07_applicationsProblem → Stakeholders → Domain Context → Inputs/Outputs → Modeling Options → Risks
tpl_reference_implementation.md08_implementationsGoal → Conceptual Counterpart → Dependencies → Code Pattern → Practical Notes
tpl_end_to_end_example.md08_implementationsGoal → Problem Setup → Data → Design → Implementation → Evaluation → Extensions
tpl_project_overview.md09_projectsGoal → Scope → Deliverables → Data → Modeling → Engineering → Timeline
tpl_reading_note.md10_readingSource → Type → Why It Matters → Key Ideas → Relevance to Vault

All notes carry frontmatter: layer, type, status (seed / growing / stable), tags, created.


Conventions

Full conventions are in 00_meta/conventions.md. Key rules:

  • File names: lowercase with underscores (chain_rule.md, not ChainRule.md)
  • Top-level structure is stable — changes require updating conventions.md
  • No duplicate concepts — if overlap exists, merge
  • Cross-linking: modeling notes link to foundations; application notes link to modeling and engineering; reference implementations link to their conceptual counterpart
  • Knowledge flows: Reading → Logs → Projects → Core layers (stable)
  • Tags: cross-cutting metadata only; never restate the folder path (see §12 of conventions)

Getting Started