streaming-inference

Here are 12 public repositories matching this topic...

paul-krug / pytorch-tcn

(Realtime) Temporal Convolutions in PyTorch

realtime causal-inference causal causal-models temporal-convolutional-networks streaming-inference realtime-neural-network

Updated Apr 7, 2025
Python

ictnlp / SLED-TTS

Star

Streamable Text-to-Speech model using a language modeling approach, without vector quantization

text-to-speech speech-synthesis streaming-inference speech-language-model

Updated May 20, 2025
Python

zhangzijie-pro / Speaker-Verification

Star

Dual-model speech AI toolkit for speaker verification and speaker-aware diarization, with streaming inference, meeting analysis, long-audio monitoring, and speaker-bank integration.

pytorch speaker-recognition speaker-verification speaker-diarization voice-ai open-set-identification audio-ml streaming-inference meeting-analysis

Updated Apr 23, 2026
Python

pszemraj / megalodon-hf

Star

Pure PyTorch + 🤗 Transformers reimplementation of Megalodon (CEMA + chunked attention) - readable, hackable, no CUDA kernels required

pytorch rope ema pytorch-implementation linear-attention efficient-transformers llm sub-quadratic-attention long-context-modeling streaming-inference complex-ema

Updated Jan 12, 2026
Python

eulogik / NanoForecast

Star

World's most deployable time series foundation model — 200K-6.5M params, zero-shot forecasting, streaming RNN inference, ONNX edge deployment, runs on Raspberry Pi

raspberry-pi machine-learning deep-learning time-series pytorch transformer forecasting deployable onnx time-series-forecasting edge-ai huggingface foundation-model streaming-inference

Updated Jun 26, 2026
Python

OmprakashSahani / atlas-ai

Star

Open ML systems platform for training, profiling, evaluating, and serving AI models.

infrastructure distributed-systems performance-engineering benchmarking transformers autograd profiling observability distributed-training research-engineering inference-optimization fastapi kv-cache ml-systems streaming-inference transformer-systems

Updated May 14, 2026
Python

wpferrell / Bigsmall

Star

Lossless AI model compression - ~34% smaller with bit-identical weights; the autopilot profiles your machine, picks the highest fidelity that runs, and streams models bigger than your RAM.

pytorch autopilot entropy-coding model-compression huggingface lossless-compression kv-cache bf16 llm fp8 safetensors streaming-inference

Updated Jun 13, 2026
Python

esl-epfl / streaminnc

Star

Don't Think It Twice: Exploit Shift Invariance for Efficient Online Streaming Inference of CNNs

time-series convolutional-neural-networks biosignals efficient-inference edge-ai shift-invariant model-optimization online-inference embedded-ml streaming-inference

Updated Aug 6, 2024
Python

ollycassidy13 / CascadeLUT

Star

CascadeLUT: Information-Ordered Streaming Inference for Bandwidth-Constrained FPGAs [FPL'26]

fpga neural-network fpl streaming-inference bandwidth-limitation

Updated Aug 20, 2025
Python

bracoTuxbr / local-coherence

Star

CPU-native inference runtime. Local-propagation paradigm: the active region pays the cost, not the field. Bit-exact across architectures. Validated for streaming anomaly detection and audio VAD.

Updated Jun 22, 2026
C++

sholokhovalexey / S4ND-U-Net_speech_enhancement

Star

Streaming version of S4ND-U-Net

speech-enhancement streaming-inference structured-state-space-sequence-model

Updated Nov 4, 2025
Python

27-ganesh-07 / realtime-music-genre

Star

Real-time music-genre classification: spectrogram CNN, ONNX-optimised, served as a streaming/chunked classifier with PyTorch-vs-ONNX benchmarks. Track-aware GTZAN eval.

audio python cnn music-genre-classification onnx torchaudio mlops gtzan onnxruntime streaming-inference

Updated Jun 25, 2026
Python

Improve this page

Add a description, image, and links to the streaming-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the streaming-inference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

streaming-inference

Here are 12 public repositories matching this topic...

paul-krug / pytorch-tcn

ictnlp / SLED-TTS

zhangzijie-pro / Speaker-Verification

pszemraj / megalodon-hf

eulogik / NanoForecast

OmprakashSahani / atlas-ai

wpferrell / Bigsmall

esl-epfl / streaminnc

ollycassidy13 / CascadeLUT

bracoTuxbr / local-coherence

sholokhovalexey / S4ND-U-Net_speech_enhancement

27-ganesh-07 / realtime-music-genre

Improve this page

Add this topic to your repo