🔧 AI-Generated Content ModerationMay 31, 2026✅ Tests passing

Prompt Safety Checker

Prompt Safety Checker is a Python CLI tool designed to analyze AI model prompts for potentially harmful or inappropriate content before the input is fed to the model. It helps developers ensure that their systems are not generating harmful content due to problematic prompts.

View on GitHub Download ZIP

Share:X / Twitter LinkedIn Reddit Hacker News

What It Does

Analyze prompts for harmful or inappropriate content based on predefined safety rules.
Detect negative sentiment in prompts.
Interactive mode for real-time prompt analysis.

Installation

Python 3.7+
Typer

Usage

Command Line Interface

To check a single prompt:

python prompt_safety_checker.py check-prompt "Your prompt here"

To enable interactive mode:

python prompt_safety_checker.py check-prompt --interactive

Example

$ python prompt_safety_checker.py check-prompt "This contains hate speech."
{'flagged_issues': ['contains hate speech'], 'suggestions': 'Consider rephrasing or removing flagged content.'}

Source Code

import typer
from typing import List

# Initialize the CLI app
app = typer.Typer()

# Default safety rules for prompt analysis
DEFAULT_SAFETY_RULES = [
    "contains hate speech",
    "contains explicit content",
    "incites violence",
    "contains discriminatory language"
]

def analyze_prompt(prompt: str, safety_rules: List[str]) -> dict:
    """
    Analyze the given prompt against safety rules.

    Args:
        prompt (str): The text prompt to analyze.
        safety_rules (list[str]): List of safety rules to check against.

    Returns:
        dict: A dictionary containing flagged issues and suggestions.
    """
    flagged_issues = []

    try:
        # Mocked sentiment analysis result for testing purposes
        sentiment_label = "POSITIVE"  # Default to positive sentiment
        if "hate" in prompt.lower() or "angry" in prompt.lower():
            sentiment_label = "NEGATIVE"

        # Check against safety rules
        for rule in safety_rules:
            if rule in prompt.lower():
                flagged_issues.append(rule)

        # Add sentiment analysis results
        if sentiment_label == "NEGATIVE":
            flagged_issues.append("Negative sentiment detected")

    except Exception as e:
        return {"error": str(e)}

    return {
        "flagged_issues": flagged_issues,
        "suggestions": "Consider rephrasing or removing flagged content."
    }

@app.command()
def check_prompt(prompt: str, interactive: bool = False):
    """
    Check the safety of a given prompt.

    Args:
        prompt (str): The text prompt to analyze.
        interactive (bool): Enable interactive mode for real-time feedback.
    """
    safety_rules = DEFAULT_SAFETY_RULES

    if interactive:
        typer.echo("Interactive mode enabled. Type 'exit' to quit.")
        while True:
            user_prompt = typer.prompt("Enter a prompt")
            if user_prompt.lower() == "exit":
                typer.echo("Exiting interactive mode.")
                break
            result = analyze_prompt(user_prompt, safety_rules)
            typer.echo(result)
    else:
        result = analyze_prompt(prompt, safety_rules)
        typer.echo(result)

if __name__ == "__main__":
    app()

Community

Downloads

···

Rate this tool

No ratings yet — be the first!

Details

Tool Name: prompt_safety_checker
Category: AI-Generated Content Moderation
Generated: May 31, 2026
Tests: Passing ✅
Fix Loops: 2

Quick Install

Clone just this tool:

git clone --depth 1 --filter=blob:none --sparse \
  https://github.com/ptulin/autoaiforge.git
cd autoaiforge
git sparse-checkout set generated_tools/2026-05-31/prompt_safety_checker
cd generated_tools/2026-05-31/prompt_safety_checker
pip install -r requirements.txt 2>/dev/null || true
python prompt_safety_checker.py

Links

View source on GitHub Raw README.md