Quantization Python - Search News

Subset-Selection Weight Post-Training Quantization Method for Learned Image Compression Task

Abstract: Post-training quantization(PTQ) has been widely studied in recent years because it does not require retraining the network or the entire training dataset. However, naively applying the PTQ ...

IEEE

Lightweight Adaptive Quantization Algorithms for Federated Learning With Heterogeneous Clients

Abstract: Quantization is a common method to improve communication efficiency in federated learning (FL) by compressing the gradients that clients upload. Currently, most application scenarios involve ...

GitHub

Amar-bach/VLM_opt

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities. Check out the details wth the load_pretrained_model function in ...

marktechpost

NVIDIA AI Brings Nemotron-3-Nano-30B to NVFP4 with Quantization Aware Distillation (QAD) for Efficient Reasoning Inference

The model is pre-trained on 25T tokens using a Warmup Stable Decay learning rate schedule with a batch size of 3072, a peak learning rate of 1e-3 and a minimum learning rate of 1e-5. The NVFP4 ...

GitHub

shirbenami/LLaVA_fine-tune

conda create -n llava python=3.10 -y conda activate llava pip install --upgrade pip # enable PEP 660 support pip install -e . Check out the details wth the load ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results