LIVE TUE, 30 JUN, 2026 BENGALURU · 28°C EDITION № 61 · FREE · NO LOGIN

Gpus

2 stories tagged Gpus · latest first

AI

AI · 2 min
Real-time LLM Inference on Standard GPUs: 3k tokens/s per request

Kog AI has launched a tech preview of its Kog Inference Engine (KIE), achieving real-time large language model (LLM) inference speeds of 3,000 output token…
29 May, 10:01 pm IST
INDIA

INDIA · 2 min
Rising AI adoption could drive adoption of 650k GPUs in India's data centres: Avendus Capital

Rising adoption of artificial intelligence (AI) in India is expected to drive demand for approximately 650,000 graphics processing units (GPUs) in the coun…
28 May, 12:11 am IST

▸ WIRE

Premium content free for first 12 months · sign up to unlock Razorpay subscriptions launch Jan 2027 — ₹199/mo or ₹999/yr Every story reads every Indian tech source so you don't have to Every article cited · trust the source, not just the byline India's startup desk, edited daily Founders · Funding · Policy · Tech — three crawls a day Premium content free for first 12 months · sign up to unlock Razorpay subscriptions launch Jan 2027 — ₹199/mo or ₹999/yr Every story reads every Indian tech source so you don't have to Every article cited · trust the source, not just the byline India's startup desk, edited daily Founders · Funding · Policy · Tech — three crawls a day

Real-time LLM Inference on Standard GPUs: 3k tokens/s per request

Rising AI adoption could drive adoption of 650k GPUs in India's data centres: Avendus Capital