Building Truth Tables

A function has 32 boolean parameters.

How many test cases do you need to exhaustively verify its behavior?

The answer is 2³² — 4,294,967,296. Four billion, two hundred ninety-four million test cases. At one test per microsecond, that is 71 minutes of pure execution time. Before assertions. Before test framework overhead. Before any I/O.

No real test suite does this. No real test suite can. This is not a gap in practice — it is a mathematical fact about the nature of boolean space. With n boolean inputs, there are 2ⁿ possible combinations. The space doubles with each additional variable.

This is where truth tables meet computational reality. A truth table is the specification of a boolean function. It lists every input combination and the required output. For small n, it is finite and buildable. For large n, it is infinite in practice. The tension between specification completeness and computational feasibility is one of the central problems of software verification.

The 32-parameter function is not hypothetical. Legacy codebases are full of functions with a dozen flag parameters, each of which is a boolean. The combination space is untestable. The bugs hide in the corners.

Thread Activation

Thread 12 (Tradeoffs) enters here — the tension between exponential completeness and practical feasibility.

Boolean composition (Chapter 2) opens a combination space. This chapter quantifies that space: n variables produce 2ⁿ rows. Chapter 17 extends this into asymptotic complexity — the formal language for measuring how costs scale. The arrow is:

T12 (Tradeoffs) ← boolean composition (Ch 2) → exponential blowup (Ch 3) → measuring efficiency (Ch 17)

The exponential growth of truth tables is the simplest concrete example of exponential complexity in this series. Every algorithm designer who avoids exponential-time solutions is, at some level, avoiding the combinatorial explosion first visible in truth tables.

The Concept

A truth table is a systematic enumeration of all possible truth value assignments to a set of variables, together with the resulting truth value of a logical expression under each assignment.

Truth tables serve two purposes:

Definition — they fully define what a logical operator or expression means. No ambiguity is possible. The table says what the output is for every input.
Verification — given two expressions claimed to be equivalent, you build both tables and compare them column by column. If they match on every row, the expressions are equivalent (Chapter 4).

The visual model: a truth table is a switch-and-circuit diagram flattened into a table. Every row is one configuration of all switches. The output column shows whether current flows (True) or not (False) through the circuit for that configuration.

How It Works

Constructing a Truth Table

Step 1: Identify the variables.

Count how many distinct propositional variables appear in the expression. Call this n.

Step 2: Determine the number of rows.

The table has 2ⁿ rows (plus a header row). One row per unique assignment of True/False to all n variables.

Step 3: Fill in the variable columns.

Assign truth values systematically. For the last variable, alternate T/F. For the second-to-last, alternate in pairs (TT/FF). For the third-to-last, alternate in groups of four (TTTT/FFFF). The pattern for variable k (counting from the right, starting at 0) is: repeat True for 2ᵏ rows, then False for 2ᵏ rows, alternating with each doubling.

Step 4: Evaluate sub-expressions.

For complex expressions, add intermediate columns. Evaluate one operator at a time, left to right, following precedence.

Step 5: Fill in the output column.

The final column is the truth value of the full expression for each row.

Example: One Variable

Expression: NOT P

P	NOT P
True	False
False	True

Two rows. The output is the inverse of the input.

Example: Two Variables

Expression: P AND Q

P	Q	P AND Q
True	True	True
True	False	False
False	True	False
False	False	False

Four rows. AND is True only on the first row — when both P and Q are True.

Example: Three Variables with Sub-expressions

Expression: (P AND Q) OR (NOT P AND R)

Three variables: P, Q, R. Eight rows.

P	Q	R	P AND Q	NOT P	NOT P AND R	(P AND Q) OR (NOT P AND R)
True	True	True	True	False	False	True
True	True	False	True	False	False	True
True	False	True	False	False	False	False
True	False	False	False	False	False	False
False	True	True	False	True	True	True
False	True	False	False	True	False	False
False	False	True	False	True	True	True
False	False	False	False	True	False	False

The intermediate columns make each step auditable. You can verify any single cell without reading the whole table.

Counting Rows: Why 2ⁿ

Each variable is independent. It can be True or False. Two choices. n variables, each with two choices, give 2 × 2 × … × 2 (n times) = 2ⁿ combinations.

This is the fundamental counting principle applied to boolean variables. It is also the same logic that gives you 2ⁿ possible subsets of an n-element set (each element is either in or out — two choices, n decisions).

n	2ⁿ	Practical assessment
1	2	Trivial
2	4	Trivial
3	8	A couple minutes by hand
4	16	A short table
8	256	Manageable by program
16	65,536	Fast for a computer
20	1,048,576	One million — still fast
32	4.3×10⁹	Four billion — minutes of compute
64	1.8×10¹⁹	Cosmological — beyond any exhaustion
2²⁶⁵	10⁸⁰	Roughly the number of atoms in the universe

The growth is not linear. It is not polynomial. It is exponential. Each variable added doubles the table size. At n=32 you are already in “impractical to exhaust” territory for most testing frameworks.

Truth Tables as Boolean Function Specifications

A truth table is the canonical specification of a boolean function. Given a truth table, you can construct a boolean expression that produces exactly those outputs. The procedure is called Disjunctive Normal Form (DNF) construction:

For every row where the output is True, write a conjunction (AND) of all input variables (negated if the variable is False on that row).
Combine all these conjunctions with disjunctions (OR).

def build_dnf_expression(truth_table: list[dict]) -> str:
    """
    Build a DNF expression string from a truth table.
    truth_table: list of rows, each row is a dict with variable names as keys
    and 'output' key for the result.
    """
    terms = []
    variables = [k for k in truth_table[0] if k != 'output']

    for row in truth_table:
        if not row['output']:
            continue  # DNF only uses rows where output is True

        # Build a minterm: conjunction of all variables (negated if False)
        literals = []
        for var in variables:
            if row[var]:
                literals.append(var)          # positive literal
            else:
                literals.append(f"NOT {var}")  # negated literal

        term = " AND ".join(literals)
        terms.append(f"({term})")

    return " OR ".join(terms) if terms else "False"


# Example: XOR truth table
xor_table = [
    {'P': True,  'Q': True,  'output': False},
    {'P': True,  'Q': False, 'output': True},
    {'P': False, 'Q': True,  'output': True},
    {'P': False, 'Q': False, 'output': False},
]

print(build_dnf_expression(xor_table))
# Output: (P AND NOT Q) OR (NOT P AND Q)

The output — (P AND NOT Q) OR (NOT P AND Q) — is a valid boolean expression for XOR. It is derived mechanically from the truth table without any insight into what XOR “means.” This is the power of the DNF construction: given any desired behavior (any truth table), you can construct an expression for it.

This also proves that AND, OR, NOT are functionally complete: they can express any boolean function. Any truth table. Any behavior.

Truth Tables as Tests

A truth table is also a test suite. Each row is a test case: given these inputs, the output must be this value.

import itertools

def verify_boolean_function(func, expected_table: dict) -> bool:
    """
    Verify a boolean function against its truth table.
    expected_table: dict mapping (input tuples) to expected outputs.
    Returns True if all rows match, raises AssertionError otherwise.
    """
    for inputs, expected_output in expected_table.items():
        actual_output = func(*inputs)
        assert actual_output == expected_output, (
            f"Truth table violation: f{inputs} = "
            f"{actual_output}, expected {expected_output}"
        )
    return True


# Truth table for a rate-limit decision function
# Inputs: (under_limit, is_premium, is_admin)
# Output: should the request be allowed?
rate_limit_truth_table = {
    # under limit, premium, admin → allow
    (True,  True,  True):  True,
    (True,  True,  False): True,   # under limit, premium → allow
    (True,  False, True):  True,   # under limit, admin → allow
    (True,  False, False): True,   # under limit → allow
    # over limit, premium, admin → allow (premium override)
    (False, True,  True):  True,
    # over limit, premium → allow (premium override)
    (False, True,  False): True,
    # over limit, admin → allow (admin override)
    (False, False, True):  True,
    (False, False, False): False,  # over limit, no overrides → block
}

def should_allow_request(
    under_limit: bool,
    is_premium: bool,
    is_admin: bool,
) -> bool:
    # Truth-table-derived implementation
    return under_limit or is_premium or is_admin

verify_boolean_function(should_allow_request, rate_limit_truth_table)

The truth table is written first (the specification). The function is written to match it. The test calls verify_boolean_function to confirm they agree. This is specification-driven development for boolean functions. It is not a methodology. It is the natural consequence of having a complete, finite specification.

The Combinatorial Test Coverage Problem

For 3 variables, 8 rows, and 8 test cases, full coverage is trivial. For 10 variables, 1024 rows, full coverage is feasible. For 20 variables, one million rows — possible but slow. For 32 variables, 4 billion rows — impractical.

The real-world analog is any function with multiple boolean flags. Consider a function that processes an HTTP request with these boolean flags:

def process_request(
    is_authenticated: bool,
    has_read_permission: bool,
    has_write_permission: bool,
    is_rate_limited: bool,
    request_is_valid: bool,
    session_is_active: bool,
    resource_exists: bool,
    content_type_valid: bool,
) -> Response:
    ...

8 boolean parameters → 256 input combinations. A thorough manual test would cover all 256. In practice, testing frameworks use equivalence partitioning and boundary value analysis to select representative subsets. These are heuristics for approximating full truth table coverage.

The alternative is formal verification: express the function as a boolean formula and use a SAT or SMT solver (Satisfiability Modulo Theories) to automatically check properties over all 256 combinations. This is how hardware verification works. Software verification is catching up.

Tradeoffs

AT9 — Correctness vs. Performance

The truth table gives you correctness. Full truth-table testing guarantees the function is correct for all inputs. The cost is 2ⁿ test cases.

For n=3, correctness is cheap. For n=32, correctness via exhaustion is impractical. You choose between:

Correctness guarantee (enumerate 2ⁿ rows) — exponential cost, no bugs.
Performance (test a subset of rows) — polynomial cost, possible bugs in untested combinations.

No engineering choice eliminates this tension. The tools available are:

Reduce n. Fewer boolean parameters means fewer combinations. Functions with many boolean flags often have implicit parameter coupling that can be expressed as enums. An enum with 4 values is one variable with 4 states — not 4 boolean variables with 16 combinations.
Formal methods. Model checkers and SMT solvers reason over all combinations without enumerating them. They trade compute time for theoretical completeness. For functions with 50 boolean inputs, a SAT solver might determine satisfiability of a property in milliseconds.
Property-based testing. Tools like Hypothesis (Python) generate random inputs from the input domain, then search for failing cases. This is not 2ⁿ coverage, but it explores the space more broadly than manual tests.
Accept partial coverage. For most production software, this is the actual choice. Document which combinations are tested. Design the function to minimize the impact of untested edge cases.

The AT9 tradeoff is always present. Making it explicit forces a deliberate decision about which corners of the truth table are important.

# AT9 illustration: reduce parameter count by replacing booleans with enum
from enum import Enum

class AccessLevel(Enum):
    # Instead of 3 booleans (is_admin, is_editor, is_viewer) = 8 combinations
    # Use one enum = 4 states, clearly ordered
    ADMIN = "admin"
    EDITOR = "editor"
    VIEWER = "viewer"
    NONE = "none"

def can_edit(access: AccessLevel) -> bool:
    # 4 cases instead of 8 combinations — truth table is 4 rows
    return access in (AccessLevel.ADMIN, AccessLevel.EDITOR)

Where It Fails

FM3 — Unbounded Resource Consumption

The failure mode is not in building truth tables. It is in expecting exhaustive truth-table testing to scale.

Consider a function with a boolean gate for each feature flag in a production system. Large systems have dozens of feature flags. 40 feature flags → 2⁴⁰ ≈ 1 trillion combinations. A test suite attempting to cover the full truth table would require terabytes of test data and years of runtime.

The symptom: a CI pipeline that grows slower with every new feature flag. Each flag doubles the number of theoretically-required test cases. Teams respond by testing less — fewer combinations per flag — and that is the correct tradeoff when made explicitly. It is FM3 when the team does not realize they are making it and simply runs out of capacity.

The structural fix is to minimize interaction between feature flags. If two flags are fully independent — one governs the payment flow, the other governs the notification system — they do not interact in the truth table. You can test each independently: 2¹ + 2¹ = 4 tests instead of 2² = 4. For two independent flags this matches the exhaustive count (4 = 2²), but the saving emerges as flags multiply. For 3 independent flags: 3 × 2 = 6 tests instead of 2³ = 8. The saving grows: 40 independent flags require 80 tests instead of ≈10¹².

Independence is the key. When flags interact, the combination space grows. When they are independent, it collapses. Designing feature flags to be independent — to avoid shared mutable state and shared decision points — is a concrete engineering technique to prevent FM3 in test suites.

Real Systems

Hardware Verification — digital circuits are boolean functions. A 64-bit adder has 128 input bits → 2¹²⁸ input combinations. No team builds the full truth table. Instead, hardware design uses formal verification: the circuit is expressed as a boolean formula, and an automated theorem prover checks properties (commutativity, overflow behavior, carry propagation) over all inputs symbolically. The truth table is never enumerated; its properties are proven.

CPU Instruction Decoders — the instruction decoder in a CPU maps binary opcode patterns to control signals. Each opcode bit is a boolean. The decoder’s behavior is fully defined by a truth table (or Karnaugh map). CPU designers use Boolean minimization algorithms (Quine-McCluskey, Espresso) to reduce the truth table to the smallest circuit that produces the correct outputs. Truth tables are inputs to optimization algorithms.

Test Coverage Metrics — code coverage tools (coverage.py, gcov) measure how many branches of the program’s implicit truth table have been exercised. A function with 5 boolean conditions has a theoretical 2⁵ = 32 path combinations. Branch coverage checks each condition individually (2 × 5 = 10 states). Modified Condition/Decision Coverage (MC/DC), required for aviation software (DO-178C), checks that each condition independently affects the outcome — a formal property that demands far fewer test cases — typically n+1 for n conditions.

SQL Query Optimization — the SQL query optimizer treats a WHERE clause as a boolean expression. It applies truth-table-based identities to rewrite the expression: eliminate redundant terms, transform expressions to use available indexes, collapse constant sub-expressions. An expression like WHERE TRUE AND status = 'active' reduces to WHERE status = 'active'. The optimizer is applying boolean simplification — the same transformations that reduce truth table rows.

Feature Flag Systems — platforms like LaunchDarkly evaluate feature flag conditions against user attributes. A flag might be: environment == "production" AND user_in_group("beta") AND random_percentage < 0.1. This is a truth-table row — a conjunction of atomic conditions. Evaluating this for every request is fast (one conjunction). The complexity appears when flags depend on other flags — the combination space grows and interactions become hard to reason about.

Concept: Truth Tables

Core Idea: A truth table is the complete specification of a boolean function — one row per input combination. n variables require 2ⁿ rows. This exponential growth makes exhaustive testing infeasible beyond small n, establishing the central tension between specification completeness and computational cost.

Tradeoff: AT9 — Correctness vs. Performance (full truth-table coverage guarantees correctness; testing subsets is faster but leaves gaps)

Failure Mode: FM3 — Unbounded Resource Consumption (attempting exhaustive truth-table testing scales exponentially; 32 boolean parameters require billions of test cases)

Signal: When a function has many boolean parameters, or when test coverage of a conditional block feels incomplete — enumerate the truth table for small n, use formal methods or property-based testing for large n.

Exercises

Level 1 — Understand

How many rows does a truth table require for each of the following? Do not compute the value — just write the expression. Then compute it.
- 2 variables
- 5 variables
- 10 variables
- n variables
Build the complete truth table for (P OR Q) AND (NOT P OR R). Show all intermediate columns. There are 3 variables — how many rows?
What is Disjunctive Normal Form? Given the truth table for XOR (P ⊕ Q), construct the DNF expression manually, without using the XOR symbol.

Level 2 — Apply

A function grant_access(is_authenticated, is_authorized, is_not_banned) should return True only when all three inputs are True, and False otherwise.

Build the complete truth table. How many rows?
From the truth table, construct the DNF expression.
Simplify the DNF expression. What is the simplest equivalent boolean expression?
A new requirement: “also grant access if the user is a superuser, regardless of ban status.” Introduce a 4th boolean is_superuser. How many rows does the new truth table have? Write the updated function.

Level 3 — Design

A deployment pipeline has 8 boolean gate conditions: tests_passed, lint_clean, security_scan_passed, staging_deployed, smoke_tests_passed, approval_received, change_window_open, no_active_incidents.

How many input combinations exist? Is exhaustive testing feasible in a CI pipeline that runs in under 5 minutes?
Identify which pairs of conditions are likely to be independent (do not interact). Which pairs are likely to interact? Justify your reasoning.
Propose a testing strategy that achieves high confidence without exhaustive coverage. Explicitly name the AT9 tradeoff you are making and quantify the coverage gap (how many combinations are you leaving untested?).
A colleague suggests replacing the 8 booleans with a PipelineState enum. Design the enum values. How many rows does the new “truth table” have? What does this gain and what does it lose? Use AT3 notation.

A complete answer will: walk the reasoning step by step, cite the relevant tradeoff and failure-mode codes, and show the calculation or invariant that supports the conclusion.