“Learning to Quantify: Methods and Applications” is a tutorial + workshop event co-located with the ECML/PKDD 2024 conference, and will take place on Friday, September 13, 2024, in Vilnius, Lithuania.
Learning to Quantify (LQ - also known as “quantification“, or “supervised prevalence estimation“, or “class prior estimation“, or “unfolding”) is the task of training class prevalence estimators via supervised learning. In other words, the task of these trained models is to estimate, given an unlabelled sample of data items and a set of classes, the prevalence (i.e., relative frequency) of each such class in the sample.
LQ is interesting in all applications of classification in which the final goal is not determining which class (or classes) individual unlabelled data items belong to, but estimating the percentages of data items that belong to the classes of interest, i.e., estimating the distribution of the unlabelled data items across the classes. Example disciplines whose interest in labelling data items is at the aggregate level (rather than at the individual level) are the social sciences, political science, market research, ecological modelling, and epidemiology.
While LQ may in principle be solved by classifying each data item in the sample and counting how many such items have been labelled with a certain class, it has been shown that this “classify and count” method yields suboptimal quantification accuracy. As a result, quantification is now no longer considered a mere byproduct of classification, and has evolved as a task of its own.
The goal of the Learning to Quantify tutorial is to provide an introduction to Learning to Quantify that presents its rationale, applications, methods, and evaluation measures and methodologies, to researchers who might want to become up-to-speed on this topic. A book on Learning to Quantify will also be distributed to participants, along with the slides used by the instructors.
The goal of the LQ 2024 workshop is to bring together all researchers interested in methods, algorithms, evaluation measures, evaluation protocols, and methodologies for LQ, as well as practitioners interested in the practical application of the above to managing large quantities of data. The workshop will feature presentations of submitted papers, presentations of the results of the 2024 Learning to Quantify data challenge, and a final open discussion on “what’s next in Learning to Quantify”.
LQ 2024 is supported by the SoBigData++ project, funded by the European Commission (Grant 871042) under the H2020 Programme INFRAIA-2019-1, by the AI4Media project, funded by the European Commission (Grant 951911) under the H2020 Programme ICT-48-2020, and by the QuaDaSh project, funded by the European Commission under the NextGenerationEU program. The organizers’ opinions do not necessarily reflect those of the European Commission.