Transformer LLM - Search News

New LLM optimization technique slashes memory costs up to 75%

Researchers at the Tokyo-based startup Sakana AI have developed a new technique that enables language models to use memory more efficiently, helping enterprises cut the costs of building applications ...

1don MSN

Tiny startup Arcee AI built a 400B open source LLM from scratch to best Meta’s Llama

30-person startup Arcee AI has released a 400B model called Trinity, which it says is one of the biggest open source foundation models from a US company.

InfoQ

Meta Open-Sources Byte Latent Transformer LLM with Improved Scalability

Meta open-sourced Byte Latent Transformer (BLT), an LLM architecture that uses a learned dynamic scheme for processing patches of bytes instead of a tokenizer. This allows BLT models to match the ...

SDxCentral

DeepSeek looks to offload simple LLM tasks to save billions of parameters

Detailed in a recently published technical paper, the Chinese startup’s Engram concept offloads static knowledge (simple ...

11d

DeepSeek’s Engram Conditional Memory Shows How to Reduce AI Compute Waste

DeepSeek's new Engram AI model separates recall from reasoning with hash-based memory in RAM, easing GPU pressure so teams ...

News Medical

Insilico and NVIDIA unveil new LLM transformer for solving biological and chemical tasks

In a new paper, researchers from clinical stage artificial intelligence (AI)-driven drug discovery company Insilico Medicine ("Insilico"), in collaboration with NVIDIA, present a new large language ...

WinBuzzer

AI: Memory Bottleneck Emerges as Main LLM Inference Challenge

Google researchers have revealed that memory and interconnect are the primary bottlenecks for LLM inference, not compute power, as memory bandwidth lags 4.7x behind.

Mongabay

Founding AI/ML Engineering Lead (LLM Specialist)

Full-time About Mongabay Mongabay is a leading environmental news platform that reaches over 60 million people annually with trusted journalism about conservation, climate change, and environmental ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results