Streaming RAG for IIoT: using LLM to perform live analytic for asset performance management

Classic RAG (Retrieval-Augmented Generation) assumes static documents. In IoT, knowledge changes every second. Streaming RAG means:

Ingest continuously (MQTT telemetry stream)
Index incrementally (append-only time-series + optionally semantic embeddings)
Retrieve just-in-time (latest values, last N minutes, anomalies)
Generate answers with streaming tokens (LLM starts answering immediately, keeps improving as retrieval completes)

This gives you a “living assistant” that can answer operational questions in real-time and trigger actions (alarms) deterministically.

Clone code from

 git clone https://github.com/venergiac/iiot-streaming-rag.git