🚀 Parallel-Probe is a training-free controller for efficient parallel reasoning in large language models. Using 2D Probing, we reveal global width–depth dynamics of parallel trajectories, uncovering ...
This repository contains the official PyTorch implementation for the ECCV2024 paper "AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer". AdaLog adapts the ...