SS26 inovex: Counting the Unknown: The Class-Agnostic Low-Shot Challenge

Counting the Unknown: The Class-Agnostic Low-Shot Challenge

Sponsored by: inovex GmbH
Project lead: Dr. Ricardo Acevedo Cabra
Scientific lead: Dr. Dominik Traxl, Maximilian Vieweg, Kamilla Mohr
TUM co-mentor: TBA
Term: Summer semester 2026
Application deadline: Sunday 25.01.2026

Apply to this project here

The project focuses on developing a class-agnostic, low-shot, multi-label visual object-counting system. Traditional counting methods depend heavily on class-specific datasets and extensive model fine-tuning, limiting their flexibility in real-world applications. In contrast, this project aims to design a system that can accurately count any object class using only a handful of example clicks, text labels or exemplar images.

Motivation

Object counting is essential across diverse domains such as inventory management, quality control, environmental monitoring, and public safety. A generalized solution that works across object categories without retraining would dramatically improve scalability, adaptability, and practical usability in these fields.

Goals
● Primary Goal: Build a robust, high-accuracy system for class-agnostic, low-shot, multi-label object counting.
● True Multi-Label Capability: Count multiple distinct object classes in an image using only a few examples per class.
● Complex Scene Handling: Maintain high performance in dense, cluttered, or partially occluded scenes.
● Interactive Prototype: Develop a user-friendly interface where non-experts can upload an image, provide 1–5 clicks textual labels or examples, and receive object counts.

Key Milestones
1. SOTA Review & Baseline: Literature review and benchmarking of relevant few-shot and detection-based counting methods.
2. Core Model Development: Integrate module for multi-label inputs, focussing on feature disentanglement for similar classes.
3. Improvement of Reliability: Research and implement techniques to improve robustness across diverse image domains.
4. Evaluation & Refinement: Evaluate model against benchmarks/real-world images and refine it.
5. Build Demonstrator: Develop demonstrator to process user uploads and present the model results.

Requirements for Students

Students should have familiarity with Python and modern machine-learning frameworks (PyTorch, Scikit-Learn, OpenCV, Transformers). Experience with Git, cloud-based development, and REST APIs is beneficial. Curiosity, willingness to experiment with state-of-the-art vision models, and the ability to build clean, user-oriented prototypes are essential.

Apply to this project here

To top

Be part of TUM-DI-LAB!

Mentors:

cutting edge knowledge is essential for our lab. Professors, postdocs and doctoral students are welcome as project mentors. Find out here how to become a mentor.

Partners:

Industrial partners are indispensable for TUM-DI-LAB. Find out here how to become a partner

Munich Data Science Institute

TUM-DI-LAB is part of the Munich Data Science Institute (MDSI) since October 2021