Welcome to Ever Works

The Excellence
Directory Platform Template

This is a demo directory website built with Ever Works

Active Filters

Selected Tags:

Llm Inference

Sort By

Tags

1 tag

ruvllm

Local LLM inference engine supporting GGUF models with hardware acceleration on Metal, CUDA, ANE, WebGPU. Features Flash Attention, MicroLoRA, RoPE, quantization (Q4-Q8, π-Quantization), MoE routing, and streaming tokens for browser and edge deployment.

000

ruvllm-wasm

Browser-based LLM inference using WebGPU for RuVector ecosystem, enabling lightweight AI model execution in WASM environments.

000

Page 1 of 84

The Excellence
Directory Platform Template

Categories

Active Filters

Sort By

Tags

Tags

Connect with us

Stay Updated

Product

Clients

Company

Resources

The Excellence Directory Platform Template

Categories997

Categories

Active Filters

Sort By

Tags

Tags

The Excellence
Directory Platform Template