]> git.ipfire.org Git - thirdparty/rspamd.git/commit
Add GPU and vast.ai support for neural embedding service
authorVsevolod Stakhov <vsevolod@rspamd.com>
Tue, 20 Jan 2026 12:16:36 +0000 (12:16 +0000)
committerVsevolod Stakhov <vsevolod@rspamd.com>
Tue, 20 Jan 2026 12:16:36 +0000 (12:16 +0000)
commitf4cfde49ec22fa4ce4bc9724b9391700e5ec80aa
treea448d8a634be2bcd75339c70eaf9c75ad6d49366
parentd16db9dd67072df1dbb8d14fbdef467d29232e82
Add GPU and vast.ai support for neural embedding service

- Add Dockerfile.gpu for GPU-accelerated inference with PyTorch CUDA
- Add requirements-gpu.txt with pinned versions for CUDA compatibility
- Add vastai-launch.sh script for deploying on vast.ai cloud GPUs
- Update README with GPU deployment instructions and model recommendations

Default GPU model: intfloat/multilingual-e5-large (100+ languages including Russian)
Tested on RTX 4090 with ~20-50ms latency per embedding.
contrib/neural-embedding-service/Dockerfile.gpu [new file with mode: 0644]
contrib/neural-embedding-service/README.md
contrib/neural-embedding-service/docker-compose.yml [new file with mode: 0644]
contrib/neural-embedding-service/requirements-gpu.txt [new file with mode: 0644]
contrib/neural-embedding-service/vastai-launch.sh [new file with mode: 0755]