See it in action

You say

My Ollama server keeps going down overnight and I don't know why. Run OllamaWatch and set up Telegram alerts so I get notified the moment it goes down or recovers.

Your agent does

🔴 OLLAMA DOWN http://localhost:11434 Error: Connection refused (is Ollama running?)

💡 Ollama is not running. Start with: ollama serve (or check the systemd/service).

2026-06-07 21:42:18 UTC

✅ OLLAMA RECOVERED http://localhost:11434 Latency: 142 ms Loaded: llama3.1:8b-instruct-q4_K_M

2026-06-07 21:47:33 UTC

What you get

Monitor AI server health 24/7 without manual log checkingReceive Telegram alerts for CUDA Out-of-Memory (OOM) errorsDetect stalled model downloads automatically in the backgroundCheck VRAM availability before loading large model weightsMonitor a remote Ollama box you cannot see (headless homelab or VPS)Get woken up if an overnight batch job hangs the GPUDetect when a model pull has stalled mid-downloadReplace expensive SaaS monitoring with a $0 setupCatch memory leaks before they take down your inference stack

About this skill

Stop babysitting your Ollama server. Get a Telegram alert the moment a model crashes, the GPU runs out of memory, or the API stops responding.

Built for homelabbers and self-hosters running Ollama 24/7 who don't want to pay for SaaS monitoring or refresh dashboards manually.

Features:

DOWN alerts (🔴): instant notification when the API is unreachable, hung, or returning errors
DEGRADED alerts (⚠️): CUDA OOM, runner crashes, log errors, GPU memory above 95%
RECOVERED alerts (✅): service is back, includes latency and currently loaded models
Smart alerting: fires only on state transitions, no spam
Built-in fix hints: every alert includes a suggested remediation
NVIDIA GPU monitoring via nvidia-smi (gracefully skipped on AMD/Intel/CPU-only setups)
Cross-platform: Linux, macOS, Windows (auto-detects log paths)
Single-file Python script: no Docker, no Node.js, no SaaS subscription
MIT licensed

Install: pip install -r requirements.txt python3 ollamawatch.py check # one-shot health check python3 ollamawatch.py watch # continuous monitoring

Requires: Python 3.8+, Ollama (local or remote), free Telegram bot (5-min setup).

Use cases:

Monitor a remote Ollama box you cannot see (headless homelab, VPS)
Get woken up if an overnight batch job hangs the GPU
Catch memory leaks before they take down your inference stack
Detect when a model pull has stalled mid-download
Replace expensive SaaS monitoring ($0/month vs $30+/month)

Frequently Asked Questions

OllamaWatch — Catch Ollama Crashes Before Your Users Do (Telegram Alerts)

See it in action

What you get

About this skill

How to install

Reviews

No reviews yet

Trust & safety

Creator

Frequently Asked Questions

Popular in AI Agents & LLM Ops

designing-hybrid-context-layers

context-window-tracker

prompt-engineer

codex-grade-coding

OllamaWatch — Catch Ollama Crashes Before Your Users Do (Telegram Alerts)

See it in action

What you get

About this skill

Known limitations

How to install

Reviews

No reviews yet

Trust & safety

Permissions required

Creator

Frequently Asked Questions

What specific problem does OllamaWatch solve for local LLM users?

Which operating systems and hardware setups are compatible with this tool?

How difficult is the installation process for someone who isn't a DevOps expert?

Does this provide more information than a simple 'ping' to the API?

What exactly is included in the purchase of this skill?

Will I be flooded with Telegram notifications if my server stays down for an hour?

Popular in AI Agents & LLM Ops

designing-hybrid-context-layers

context-window-tracker

prompt-engineer

codex-grade-coding