1

    OllamaWatch — Local LLM Health Monitor + Telegram Alerts

    Monitor your local Ollama server and get Telegram alerts when models crash, the GPU runs out of memory, or downloads stall.

    Updated Jun 2026
    Instant install
    Intel

    $7

    · or 35 credits

    30-day refund guarantee

    Secure checkout via Stripe

    Included in download

    • Monitor AI server health 24/7 without manual log checking
    • Receive Telegram alerts for CUDA Out-of-Memory (OOM) errors
    • terminal, file_read, file_write automation included
    • Ready for Intel
    • Instant install

    Sample input

    My Ollama server keeps going down overnight and I don't know why. Run OllamaWatch and set up Telegram alerts so I get notified the moment it goes down or recovers.

    Sample output

    🔴 OLLAMA DOWN http://localhost:11434 Error: Connection refused (is Ollama running?) 💡 Ollama is not running. Start with: `ollama serve` (or check the systemd/service). 2026-06-07 21:42:18 UTC --- ✅ OLLAMA RECOVERED http://localhost:11434 Latency: 142 ms Loaded: llama3.1:8b-instruct-q4_K_M 2026-06-07 21:47:33 UTC

    About This Skill

    Stop babysitting your Ollama server. Get a Telegram alert the moment a model crashes, the GPU runs out of memory, or the API stops responding. Built for homelabbers and self-hosters running Ollama 24/7 who don't want to pay for SaaS monitoring or refresh dashboards manually. Features: - DOWN alerts (🔴): instant notification when the API is unreachable, hung, or returning errors - DEGRADED alerts (⚠️): CUDA OOM, runner crashes, log errors, GPU memory above 95% - RECOVERED alerts (✅): service is back, includes latency and currently loaded models - Smart alerting: fires only on state transitions, no spam - Built-in fix hints: every alert includes a suggested remediation - NVIDIA GPU monitoring via nvidia-smi (gracefully skipped on AMD/Intel/CPU-only setups) - Cross-platform: Linux, macOS, Windows (auto-detects log paths) - Single-file Python script: no Docker, no Node.js, no SaaS subscription - MIT licensed Install: pip install -r requirements.txt python3 ollamawatch.py check # one-shot health check python3 ollamawatch.py watch # continuous monitoring Requires: Python 3.8+, Ollama (local or remote), free Telegram bot (5-min setup). Use cases: - Monitor a remote Ollama box you cannot see (headless homelab, VPS) - Get woken up if an overnight batch job hangs the GPU - Catch memory leaks before they take down your inference stack - Detect when a model pull has stalled mid-download - Replace expensive SaaS monitoring ($0/month vs $30+/month)

    Reviews

    No reviews yet - be the first to share your experience.

    Only users who have downloaded or purchased this skill can leave a review.

    Security Scanned

    Passed automated security review

    Permissions

    Terminal / Shell
    Read Files
    Write Files
    Network Access
    Environment Variables

    Allowed Hosts

    api.telegram.org
    localhost

    File Scopes

    ollamawatch/**

    Terminal: required to run nvidia-smi for GPU memory monitoring (read-only, fixed command, no shell expansion). Read/Write Files: config.json, .env, state file, log tailing. Network: outbound to Ollama API (default localhost:11434) and api.telegram.org for alerts only. Environment Variables: reads TELEGRAM_BOT_TOKEN and TELEGRAM_CHAT_ID from .env — these two variables only.

    Requires Python 3.8+ and an Ollama instance (local or remote). NVIDIA GPU monitoring is optional and degrades gracefully on AMD/Intel/CPU-only setups. Cross-platform: Linux, macOS, Windows.

    Frequently Asked Questions

    $7