LLM Model Applications

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

InfoWorld

Surveying the LLM application framework landscape

Large language models by themselves are less than meets the eye; the moniker “stochastic parrots” isn’t wrong. Connect LLMs to specific data for retrieval-augmented generation (RAG) and you get a more ...

InfoWorld

What is LangChain? Easier development of LLM applications

LangChain is a modular framework for Python and JavaScript that simplifies the development of applications that are powered by generative AI language models. Using large language models (LLMs) is ...

14d

Distributive Data Base Option For Large Language Model (LLM) Released By Scientel

VentureBeat

Exclusive: VectorShift raises $3M to modularize LLM application development

Today, VectorShift, a startup working to simplify large language model (LLM) application development with a modular no-code approach, announced it has raised $3 million in seed funding from 1984 ...

Nasdaq

Datadog LLM Observability Is Now Generally Available to Help Businesses Monitor, Improve and Secure Generative AI Applications

NEW YORK, June 26, 2024 /PRNewswire/ -- Datadog, Inc. (NASDAQ: DDOG), the monitoring and security platform for cloud applications, today announced the general availability of LLM Observability, which ...

Business Wire

Ant Group Unveils its Financial Large Language Model and Two New Applications

SHANGHAI--(BUSINESS WIRE)--Ant Group today unveiled its financial large language model (“the financial LLM”) at the 2023 INCLUSION·Conference on the Bund, alongside two new applications powered by the ...

Geeky Gadgets

How to Build Custom LLM Benchmarks for Your AI Applications

Have you ever wondered why off-the-shelf large language models (LLMs) sometimes fall short of delivering the precision or context you need for your specific application? Whether you’re working in a ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results