Probabilistic Programing Inference

Multi-token prediction technique triples LLM inference speed without auxiliary draft models

With reported 3x speed gains and limited degradation in output quality, the method targets one of the biggest pain points in production AI systems: latency at scale.

Researchers baked 3x inference speedups directly into LLM weights — without speculative decoding

Researchers from the University of Maryland, Lawrence Livermore, Columbia and TogetherAI have developed a training technique that triples LLM inference speed without auxiliary models or infrastructure ...

GitHub

SPIDER: Scalable Probabilistic Inference for Differential Earthquake Relocation

SPIDER is a Python toolkit for probabilistic earthquake relocation using differential travel times, neural travel‑time prediction, and scalable MCMC sampling. It combines a fast surrogate travel‑time ...

Scientific Research Publishing

Fages, F. and Soliman, S. (2008) Model Revision from Temporal Logic Properties in Computational Systems Biology. In: De Raedt, L., Frasconi, P., Kersting, K. AND Muggleton, S ...

ABSTRACT: This paper introduces a methodology that enables the relational learning framework to incorporate quantitative data derived from experimental studies in microbial ecology. The focus of using ...

eLife

Virtual Brain Inference (VBI), a flexible and integrative toolkit for efficient probabilistic inference on whole-brain models

This paper presents a valuable software package, named "Virtual Brain Inference" (VBI), that enables faster and more efficient inference of parameters in dynamical system models of whole-brain ...

eLife

Virtual Brain Inference (VBI): A flexible and integrative toolkit for efficient probabilistic inference on virtual brain models

Not revised: This Reviewed Preprint includes the authors’ original preprint (without revision), an eLife assessment, public reviews, and a provisional response from the authors. This work provides a ...

Microsoft

TerpreT: A Probabilistic Programming Language for Program Induction

We study machine learning formulations of inductive program synthesis; given input-output examples, we try to synthesize source code that maps inputs to corresponding outputs. Our aims are to develop ...

Microsoft

Probabilistic Programming with Infer.NET

Probabilistic Programming is a way of defining probabilistic models by overloading the operations in standard programming language to have probabilistic meanings. The goal is to specify probabilistic ...

ZDNet

Cloud-native computing is poised to explode, thanks to AI inference work

The CNCF is bullish about cloud-native computing working hand in glove with AI. AI inference is the technology that will make hundreds of billions for cloud-native companies. New kinds of AI-first ...

Nature

Probabilistic Programming Languages and Inference

Probabilistic programming languages (PPLs) have emerged as a transformative tool for expressing complex statistical models and automating inference procedures. By integrating probability theory into ...

IEEE

Improving Post-Training Quantization via Probabilistic Programming

Abstract: Post-training quantization (PTQ) is an effective solution for deploying deep neural networks on edge devices with limited resources. PTQ is especially attractive because it does not require ...

CNBC

Nvidia's inference growth engine

CNBC’s Deirdre Bosa joins 'Money Movers' to discuss AI usage surge fueling Nvidia. Got a confidential news tip? We want to hear from you. Sign up for free newsletters and get more CNBC delivered to ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results