Loop, Recursion and Memory Tracing

This page describes two additional PythonTA features:

Print-based debugging for loops and recursion: This feature makes tracing easier by printing the state of each loop iteration or recursive function call in a nicely-formatted table using the tabulate library.
Snapshot-based debugging for visualizing the memory diagram: This feature makes it easier to visualize the Python memory model by leveraging the MemoryViz library.

These additional features are found in the python.debug submodule.

Loop tracing with `AccumulationTable`

The following section will focus on the debugging of while and for loops. This feature uses the python_ta.debug.AccumulationTable as a context manager wrapping a loop. Let’s start with two examples.

Example 1: for loop

# demo.py
from python_ta.debug import AccumulationTable


def calculate_sum_and_averages(numbers: list) -> list:
    """Return the running sums and averages of the given numbers.
    """
    sum_so_far = 0
    list_so_far = []
    avg_so_far = None
    with AccumulationTable(["sum_so_far", "avg_so_far", "list_so_far"]):
        for number in numbers:
            sum_so_far = sum_so_far + number
            avg_so_far = sum_so_far / (len(list_so_far) + 1)
            list_so_far.append((sum_so_far, avg_so_far))

    return list_so_far


if __name__ == '__main__':
    calculate_sum_and_averages([10, 20, 30, 40, 50, 60])

When this file is run, we get the following output:

$ python demo.py
iteration    number    sum_so_far    avg_so_far    list_so_far
-----------  --------  ------------  ------------  ---------------------------------------------------------------------------
          N/A       0             None          []
          10        10            10.0          [(10, 10.0)]
          20        30            15.0          [(10, 10.0), (30, 15.0)]
          30        60            20.0          [(10, 10.0), (30, 15.0), (60, 20.0)]
          40        100           25.0          [(10, 10.0), (30, 15.0), (60, 20.0), (100, 25.0)]
          50        150           30.0          [(10, 10.0), (30, 15.0), (60, 20.0), (100, 25.0), (150, 30.0)]
          60        210           35.0          [(10, 10.0), (30, 15.0), (60, 20.0), (100, 25.0), (150, 30.0), (210, 35.0)]

Example 2: while loop

To use AccumulationTable with while loops, you need to pass in the name of the loop variable when initializing the table.

# demo.py
from python_ta.debug import AccumulationTable


def calculate_sum_up_to_target(target: int) -> list:
    """Return the running sums of the given target.
    """
    number = 0
    sum_so_far = 0
    list_so_far = []
    with AccumulationTable(["number", "sum_so_far", "list_so_far"]):
        while number <= target:
            sum_so_far = sum_so_far + number
            list_so_far = list_so_far + [number]
            number += 1

    return list_so_far


if __name__ == '__main__':
    calculate_sum_up_to_target(5)

When this file is run, we get the following output:

$ python demo.py
iteration    number    sum_so_far    list_so_far
-----------  --------  ------------  ------------------
          0         0             []
          1         0             [0]
          2         1             [0, 1]
          3         3             [0, 1, 2]
          4         6             [0, 1, 2, 3]
          5         10            [0, 1, 2, 3, 4]
          6         15            [0, 1, 2, 3, 4, 5]

Example 3: Multiple sequential loops

The AccumulationTable context manager can track multiple sequential loops within the same with block. Each loop will generate its own separate table in the output.

# demo.py
from python_ta.debug import AccumulationTable


def process_with_multiple_loops(numbers: list) -> int:
    """Process numbers through multiple sequential loops.
    """
    sum_so_far = 0
    i = 0
    with AccumulationTable(["sum_so_far", "i"]):
        for number in numbers:
            sum_so_far = sum_so_far + number
        for j in range(len(numbers)):
            sum_so_far += 1
        while i < 5:
            i += 1
            sum_so_far -= i

    return sum_so_far


if __name__ == '__main__':
    result = process_with_multiple_loops([10, 20, 30])

When this file is run, we get the following output showing three separate tables (one for each loop):

$ python demo.py
iteration    number    sum_so_far    i
-----------  --------  ------------  ---
0            N/A       0             0
1            10        10            0
2            20        30            0
3            30        60            0

iteration    j    sum_so_far    i
-----------  ---  ------------  ---
0            N/A  60            0
1            0    61            0
2            1    62            0
3            2    63            0

iteration    sum_so_far    i
-----------  ------------  ---
0            63            0
1            62            1
2            60            2
3            57            3
4            53            4
5            48            5

Example 4: Different accumulators per loop

You can specify different accumulators for each loop by passing a list of lists, where each inner list corresponds to the accumulators for that loop (in order):

# demo.py
from python_ta.debug import AccumulationTable


def process_different_accumulators(numbers: list) -> tuple:
    """Process numbers with different accumulators per loop."""
    sum_so_far = 0
    count_so_far = 0
    product_so_far = 1
    unused_var = 10

    with AccumulationTable([["sum_so_far", "count_so_far"], ["product_so_far"]]):
        for x in numbers:
            sum_so_far += x
            count_so_far += 1
            unused_var += 1
        for y in numbers:
            product_so_far *= y
            unused_var -= 1

    return sum_so_far, product_so_far


if __name__ == '__main__':
    result = process_different_accumulators([2, 3, 4])

When this file is run, the first loop tracks sum_so_far and count_so_far, and the second loop tracks only product_so_far:

$ python demo.py
iteration    x    sum_so_far    count_so_far
-----------  ---  ------------  --------------
0            N/A  0             0
1            2    2             1
2            3    5             2
3            4    9             3

iteration    y    product_so_far
-----------  ---  ----------------
0            N/A  1
1            2    2
2            3    6
3            4    24

API

AccumulationTable.__init__(accumulation_names: list[str] | list[list[str]], output: None | str = None, format: Literal['table', 'csv'] = 'table') → None

Initialize an AccumulationTable context manager for print-based loop debugging.

Parameters:

accumulation_names – either a list of accumulator variable names to track across all loops, or a list of lists where each inner list contains the accumulator names for the corresponding loop.
output – optional filepath where the table will be written. If None, prints to stdout.
format – output format, either “table” (formatted table) or “csv” (CSV format).

The AccumulationTable class provides a loops instance attribute you can access after the with statement:

AccumulationTable.loops: list[dict[str, Any]]: A list of dictionaries containing loop-specific data for each loop in the with block.

This is a list of dictionaries, where each dictionary contains data for one loop:

table.loops[i]["loop_variables"]: A dictionary mapping loop variable names to their values during each iteration
table.loops[i]["loop_accumulators"]: A dictionary mapping accumulator names to their values during each iteration
table.loops[i]["loop_lineno"]: The line number of the loop in the source code

For convenience, when tracking a single loop, you can use these read-only properties instead:

AccumulationTable.loop_variables: Read-only access to loop variable values. Only works with a single loop in context manager.

AccumulationTable.loop_accumulators: Read-only access to accumulator values. Only works with a single loop in context manager.

For example, to access data from a single loop:

from python_ta.debug import AccumulationTable


def calculate_sum_and_averages(numbers: list) -> list:
    """Return the running sums and averages of the given numbers.
    """
    sum_so_far = 0
    list_so_far = []
    avg_so_far = None
    with AccumulationTable(["sum_so_far", "avg_so_far", "list_so_far"]) as table:
        for number in numbers:
            sum_so_far = sum_so_far + number
            avg_so_far = sum_so_far / (len(list_so_far) + 1)
            list_so_far.append((sum_so_far, avg_so_far))

    # Access the first (and only) loop's data
    print(table.loop_variables)
    print(table.loop_accumulators)
    return list_so_far

For multiple sequential loops, you can access each loop’s data separately:

from python_ta.debug import AccumulationTable


def process_with_multiple_loops(numbers: list) -> int:
    """Process numbers through multiple sequential loops.
    """
    sum_so_far = 0
    i = 0
    with AccumulationTable(["sum_so_far", "i"]) as table:
        for number in numbers:
            sum_so_far = sum_so_far + number
        for j in range(len(numbers)):
            sum_so_far += 1
        while i < 5:
            i += 1
            sum_so_far -= i

    # Access data from each loop
    print("First loop:", table.loops[0]["loop_accumulators"])
    print("Second loop:", table.loops[1]["loop_accumulators"])
    print("Third loop:", table.loops[2]["loop_variables"])

    return sum_so_far

You also have the option to pass in a file path as an attribute to the AccumulationTable object. In this case, the table will be appended to the file instead of being written the console.

Finally, you can also specify the output format as a third attribute using the format argument. By default, the format is “table”, which produces the same nicely formatted tabular output shown above. If you want to write the output as a .csv file, pass “csv” as the format.

For example, the following code writes a csv formattted output to output.txt.

from python_ta.debug import AccumulationTable


def calculate_sum_and_averages(numbers: list) -> list:
    """Return the running sums and averages of the given numbers.
    """
    sum_so_far = 0
    list_so_far = []
    avg_so_far = None
    output_file = 'output.txt'
    with AccumulationTable(["sum_so_far", "avg_so_far", "list_so_far"], output=output_file, format='csv') as table:
        for number in numbers:
            sum_so_far = sum_so_far + number
            avg_so_far = sum_so_far / (len(list_so_far) + 1)
            list_so_far.append((sum_so_far, avg_so_far))

    return list_so_far

Current limitations

The AccumulationTable is a new PythonTA feature and currently has the following known limitations:

AccumulationTable uses sys.settrace to update variable state, and so is not compatible with other libraries (e.g. debuggers, code coverage tools).
Nested loops are not tracked. The AccumulationTable is designed to work with sequential loops (loops that run one after another).

Recursion tracing with `RecursionTable`

This section will discuss the debugging of recursive functions. This feature uses the python_ta.debug.RecursionTable class as a context manager wrapping a recursive function.

Example usage

# demo.py
from python_ta.debug import RecursionTable

def factorial(n: int) -> int:
    """Calculate the factorial of n."""
    if n == 0:
        return 1
    return n * factorial(n - 1)

def trace_factorial(number: int) -> None:
    "Trace a recursively defined factorial function using RecursionTable."
    with RecursionTable("factorial"):
        factorial(number)

if __name__ == '__main__':
    trace_factorial(4)

When this file is run, we get the following output:

$ python demo.py
n    return value    called by
---  --------------  ------------
4    24              N/A
3    6               factorial(4)
2    2               factorial(3)
1    1               factorial(2)
0    1               factorial(1)

API

RecursionTable.__init__(function_name: str) → None: Initialize a RecursionTable context manager for print-based recursive debugging of <function_name>.

The RecursionTable class has the following methods you can access after the with statement.

RecursionTable.get_recursive_dict() → dict[str, list]: Use the instance variables that define the table to create a final dictionary which directly represents the table.

For example:

# demo.py
from python_ta.debug import RecursionTable

def factorial(n: int) -> int:
    """Calculate the factorial of n."""
    if n == 0:
        return 1
    return n * factorial(n - 1)

def trace_factorial(number: int) -> None:
    "Trace a recursively defined factorial function using RecursionTable."
    with RecursionTable("factorial") as table:
        factorial(number)

    traced_data = table.get_recursive_dict()
    print(traced_data)

Tracing with user-defined classes

Both AccumulationTable and RecursionTable call str on objects to display table entries. If you plan to use instances of a user-defined class in these tables (for example, a Tree class when tracing a recursive Tree method), we recommend implementing the __str__ method in your class to ensure a meaningful output is displayed.

Current limitations

The RecursionTable is a new PythonTA feature and currently has the following known limitations:

RecursionTable uses sys.settrace to update variable state, and so is not compatible with other libraries (e.g. debuggers, code coverage tools).
Only one function can be traced per use of RecursionTable, and so mutually-recursive functions are not supported.

Tracing the Python Memory Model

The following section will focus on tracing the Python memory model. This feature uses the python_ta.debug.SnapshotTracer as a context manager to visualize program memory.

Example usage

# demo.py
from python_ta.debug import SnapshotTracer


def func_multi_line(output_directory: str = None) -> None:
    """
    Function for testing SnapshotTracer
    """
    with SnapshotTracer(
        output_directory=output_directory,
        include_frames=("func_multi_line",),
        exclude_vars=("output_directory",),
        memory_viz_args=MEMORY_VIZ_ARGS,
    ):
        num = 123
        some_string = "Hello, world"
        num2 = 321
        arr = [some_string, "string 123321"]


if __name__ == '__main__':
    func_multi_line()

When this function runs, the variables within func_multi_line are captured, and memory models are outputted to output_directory for each line of code. For the expected output, refer to the snapshots in tests/test_debug/snapshot_tracer_testing_snapshots/func_multi_line.

API

SnapshotTracer.__init__(output_directory: str | None = None, webstepper: bool = False, **kwargs) → None

Initialize a context manager for snapshot-based debugging.

Parameters:

output_directory – The directory to save the snapshots, defaulting to the current directory. Note: Use this argument instead of the –output flag in memory_viz_args to specify the output directory. The directory will be created if it does not exist.
webstepper – Opens a MemoryViz Webstepper webpage to interactively visualize the resulting memory diagrams.
**kwargs – All other keyword arguments are passed to python.debug.snapshot. Refer to the snapshot function for more details.

Current Limitations

The SnapshotTracer has the following limitations:

Due to differences in Python interpreters, this context manager only works with Python versions >= 3.10.
The context manager does not step into any function calls. Calling functions within the traced function may lead to undefined behavior.
SnapshotTracer uses sys.settrace to update variable states, and therefore is not compatible with other libraries (e.g., debuggers, code coverage tools).

Loop, Recursion and Memory Tracing

Loop tracing with AccumulationTable

Example 1: for loop

Example 2: while loop

Example 3: Multiple sequential loops

Example 4: Different accumulators per loop

API

Current limitations

Recursion tracing with RecursionTable

Example usage

API

Tracing with user-defined classes

Current limitations

Tracing the Python Memory Model

Example usage

API

Current Limitations

Loop tracing with `AccumulationTable`

Recursion tracing with `RecursionTable`