ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Language Models

🎉 Paper accepted at CVPR 2025! 🎉

Yassir Bendou¹ , Amine Ouasfi² , Vincent Gripon¹ , Adnane Boukhayma² ,
¹IMT Atlantique ²INRIA

Requirements

Installation

Using conda

Create a conda environment and install dependencies:

conda create -n h2b python=3.9
conda activate h2b

pip install -r requirements.txt

Using uv

If you prefer to use uv:

uv venv --python 3.9
source .venv/bin/activate
uv pip install -r requirements.txt

Dataset

Follow DATASET.md to install ImageNet and other datasets referring to CoOp.

Get Started

Configs

The running configurations can be modified in configs.

Running

For few-shot classification:

   python main.py --method ProKeR --shots 1 2 4 8 16 --dataset caltech101 --augment-epoch 10

If GPU memory is saturated, consider using fewer data augmentations --augment-epoch

Running Options

Multiple methods are implemented:

Name	Details
ZeroShot	CLIP
TIP	Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling
GDA	A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation
CLAP	A Closer Look at the Few-Shot Adaptation of Large Vision-Language Models
ProKeR	ProKeR (ours)
ProKeR_CLAP_joint	ProKeR (ours) + CLAP

Acknowledgement

This repo benefits from Tip-Adapter, CoOp, and GDA.

Citation

@article{ProKeR,
  title={A Kernel Perspective on Training-Free Few-Shot Adaptation of Large Vision-Language Models},
  author={Bendou, Yassir and Ouasfi, Amine and Gripon, Vincent and Boukhayma, Adnane}
  journal   = {arXiv preprint},
  url       = {https://arxiv.org/abs/2501.11175}
}

Contact

If you have any question, feel free to contact [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
configs/RN50		configs/RN50
datasets		datasets
static		static
trainers		trainers
.gitignore		.gitignore
DATASETS.md		DATASETS.md
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Language Models

Requirements

Installation

Using conda

Using uv

Dataset

Get Started

Configs

Running

Running Options

Acknowledgement

Citation

Contact

About

Releases

Packages

Languages

License

ybendou/ProKeR

Folders and files

Latest commit

History

Repository files navigation

ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Language Models

Requirements

Installation

Using conda

Using uv

Dataset

Get Started

Configs

Running

Running Options

Acknowledgement

Citation

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages