💬 Open-Source LLM FrameworksMay 30, 2026✅ Tests passing

LLM Runtime Monitor

This library provides a simple way to monitor resource utilization (CPU, GPU, memory) and performance metrics while running open-source LLMs. It helps developers optimize their runtime environment and debug issues during model execution.

View on GitHub Download ZIP

Share:X / Twitter LinkedIn Reddit Hacker News

What It Does

Track CPU and memory usage in real-time
Log model-specific performance metrics like inference latency
Visualize resource utilization with simple charts

Installation

pip install psutil==5.9.5 matplotlib==3.7.2

Usage

from llm_runtime_monitor import monitor

def dummy_model(duration):
    """A dummy model function to simulate workload."""
    import time
    start = time.time()
    while time.time() - start < duration:
        sum(i * i for i in range(10000))

print("Starting LLM Runtime Monitor with a dummy model...")
monitor(dummy_model, 5)

Source Code

import psutil
import matplotlib.pyplot as plt
import time
import threading
from functools import wraps

def monitor(model_function, *args, **kwargs):
    """
    Monitors resource utilization (CPU, memory) and logs performance metrics
    while running the provided model function.

    Parameters:
        model_function (callable): The function representing the LLM execution.
        *args: Positional arguments to pass to the model function.
        **kwargs: Keyword arguments to pass to the model function.

    Returns:
        The result of the model function execution.
    """
    resource_data = {
        'time': [],
        'cpu': [],
        'memory': []
    }

    def collect_metrics():
        while monitoring["running"]:
            resource_data['time'].append(time.time() - start_time)
            resource_data['cpu'].append(psutil.cpu_percent(interval=None))
            resource_data['memory'].append(psutil.virtual_memory().percent)
            time.sleep(0.5)

    def plot_metrics():
        plt.figure(figsize=(10, 6))
        plt.plot(resource_data['time'], resource_data['cpu'], label='CPU Usage (%)')
        plt.plot(resource_data['time'], resource_data['memory'], label='Memory Usage (%)')
        plt.xlabel('Time (s)')
        plt.ylabel('Usage (%)')
        plt.title('Resource Utilization Over Time')
        plt.legend()
        plt.grid()
        plt.show()

    monitoring = {"running": True}
    start_time = time.time()

    # Start the monitoring thread
    monitor_thread = threading.Thread(target=collect_metrics)
    monitor_thread.start()

    try:
        # Run the model function
        result = model_function(*args, **kwargs)
    finally:
        # Stop monitoring and wait for the thread to finish
        monitoring["running"] = False
        monitor_thread.join()

        # Plot the metrics
        plot_metrics()

    return result

if __name__ == "__main__":
    def dummy_model(duration):
        """A dummy model function to simulate workload."""
        start = time.time()
        while time.time() - start < duration:
            sum(i * i for i in range(10000))

    print("Starting LLM Runtime Monitor with a dummy model...")
    monitor(dummy_model, 5)

Community

Downloads

···

Rate this tool

No ratings yet — be the first!

Details

Tool Name: llm_runtime_monitor
Category: Open-Source LLM Frameworks
Generated: May 30, 2026
Tests: Passing ✅

Quick Install

Clone just this tool:

git clone --depth 1 --filter=blob:none --sparse \
  https://github.com/ptulin/autoaiforge.git
cd autoaiforge
git sparse-checkout set generated_tools/2026-05-30/llm_runtime_monitor
cd generated_tools/2026-05-30/llm_runtime_monitor
pip install -r requirements.txt 2>/dev/null || true
python llm_runtime_monitor.py

Links

View source on GitHub Raw README.md