Call for Papers
The open challenge aims to explore the DiversityOne dataset, one of the largest and most geographically diverse smartphone datasets about everyday life behavior. The dataset combines questionnaires about demographic and psychosocial variables from 18K participants, and passive smartphone sensor data and self-reported annotation from 782 students across eight universities in eight countries.
This dataset is a rich and valuable research resource that can be used to address research questions in multiple fields: machine learning, mobile sensing, computational social science, design with data, and many others. The open challenge offers the opportunity to work on the dataset and gain useful feedback on your research.
We welcome contributions from researchers from diverse backgrounds and geographical provenances. In particular, we welcome contributions that address aspects including, but not limited to:
AI/ubiquitous computing/mobile sensing
- data-centric AI
- interactive machine learning
- noisy annotation detection and correction
- domain adaptation
- transfer learning
- activity and mood recognition
- responsible and ethical AI
Computational social science
- network analysis of social systems
- sequence analysis of diary data
- analysis of communities of practices
- machine learning or rule-based analysis of social behavior
Designing with data
- studies focusing on the design and documentation of the dataset collection
- studies focusing on the design affordances of the dataset
- data-centric design
- user-centered design
Important dates
From
March 28, 2025April 3, 20251: Submit your research proposal using the web form and request the datasets that you need to answer your research questions. Please specify that you are requesting the dataset to submit your work to the workshop. The full list of available datasets and documentation is accessible on the data catalog.June 8, 2025: Abstract deadline. Link to submission platform TBA
June 15, 2025 23:59 AoE: Submission deadline
June 29, 2025: Author notification
July 30, 2025: Deadline for the camera-ready version of workshop papers to be included in the ACM DL
October 12 or 13, 2015: Full-day Workshop.
Important links
Dataset paper describes the methodology and the research protocol.
Data catalog lists all available data.
Dataset request form to send the research proposal and to request copy of the datasets.
Dataset webpage summarizes the key information about the data.
The dataset
To participate, you are expected to use the DiversityOne dataset. This dataset is the result of the large-scale European project “WeNet - The Internet of US”. The dataset paper describes in detail the dataset together with the data collection methodology and statistics. contains data from college students in eight countries: China, Denmark, India, Italy, Mexico, Mongolia, Paraguay, and the United Kingdom. The study followed ethical approval procedures in each of the participating institutions and is compliant with the European General Data Protection Regulation (GDPR).
The dataset contains questionnaire answers from over 18K students, 782 of whom agreed to participate in a longitudinal survey of four weeks. The study used the iLog app to collect data from smartphone sensors such as accelerometer, gyroscope, and GPS, as well as derived information such as notification interactions, app usage, activities, and step counts. During this period, participants reported their activities, locations, social context, and mood, with daily reports on sleep quality and daily expectations.
The combination of sensor data and self-reported data across eight universities worldwide fosters research in ubiquitous computing, mobile sensing, machine learning, computational social science, and design with data. The dataset also opens the possibility for the design of new data-driven studies of diverse human behavior.
Paper submission
Short paper (max 4 pages, excluding references). The submitted works are expected to reflect on, analyze, or test the DiversityOne dataset. Submitted papers should report the work’s motivation, methodology, results, possible future analyses, and include an ethical statement describing potential societal impact.
The paper should follow the UbiComp/ISWC template and guideline. All papers and any supplementary material must be anonymized. Each submission will be reviewed by two reviewers from a panel of experts. All authors are asked to adhere to the Accessible Submission Guidelines.
All accepted publications will be published on the ACM Digital Library as part of the UbiComp 2025 proceedings. At least one author of each accepted paper needs to register for the workshop. During the workshop, each paper will be presented by one of the authors. We are exploring with the conference and workshop chairs the possibility of supporting video/remote presentations and reduced rates for authors unable to travel to the conference to promote diversity and facilitate participation.
Paper presentation
The authors of the accepted paper will be invited to present their work during the workshop. The presentation timing will be defined based on the number of submissions (tentatively, the presentation will be between 10 minutes, with 5 minutes for Q/A and discussion, in which we encourage participants to exchange ideas and approaches). The final timing will be communicated before the workshop.
Paper Awards
Papers will receive a special mention during the workshop in the form of a best paper award and a best paper runner-up award. The evaluation will be made by a committee based on the following criteria:
- creativity: how original or novel the analysis is;
- multidisciplinary: how well it combines ideas and approaches from multiple disciplines;
- presentation: how clear the presentation is and how well written the paper is;
- impact: how likely is it that the work can lead to impactful results if the paper is further extended.
Extended version in journal
A selection of the accepted papers will be invited to submit an extended version to IEEE Pervasive Computing.
How to get the datasets
Navigate the data catalog. The catalog is available on the data catalog. Please look at the available datasets and documentation. The catalog contains
- basic datasets (e.g., one single modality such as accelerometer or annotations for one university)
- dataset bundles of basic sensors (e.g., motion sensors grouping accelerometer and step counter) split by data collection location.
- projects grouping all data of one single university.
Any composition of bundle, datasets and university project can be requested coherently with your research proposal.
Proposal submission. All authors must submit a data download request through the web form that will be made available from April 3, 2025 by providing the following information:
- Names and contact information of the authors working on the dataset and their institution.
- Description of the research proposal. Provide a description of your idea and how you plan to use the dataset.
- Select from a list the datasets to download. On the data catalog, you can find and navigate the list of downloadable datasets and navigate the codebooks.
The key requirements to obtain a copy of the data are the affiliation with a research institution, either private or public, the coherence between the requested data and the research proposal, and the acceptance of the Terms and License Agreement. Key licensing terms include:
- datasets are used exclusively for research purposes;
- redistribution of the datasets is prohibited;
- datasets cannot be publicly shared (e.g., on a website);
- any attempt to reverse engineer any portion of the data or to re-identify the participants is strictly forbidden and could constitute unlawful processing of personal data.
Dataset download. The proposal request is evaluated by the University of Trento (UNITN), and in case of a positive response, the participants receive an email with the instructions for downloading the dataset after a few days. The requested data are shared with the participants through dedicated storage.
Contacts
For any questions related to the workshop or technical support regarding the datasets, please do not hesitate to reach out to the organizers at datadistribution.knowdive [at] unitn.it.
Examples of previous studies
Previous studies investigated various aspects of high-quality rich datasets that include sensor data and self-reports. Some examples are:
- the use of social media (Giunchiglia et al. 2018);
- the quality of answers and mislabeling (Bontempelli et al. 2020)
- the usefulness of self-reports towards understanding the user’s subjective perspective of the local context (Zhang et al. 2021);
- the impact of COVID on the students’ lives (Girardini et al. 2023);
- cross-individual activity recognition (Shen et al. 2022);
- mood inference (Meegahapola et al. 2023);
- diversity perceptions in a community (Kun et al. 2022);
- activity recognition (Bouton-Bessac, Meegahapola, and Gatica-Perez 2022);
- social context inference while eating (Kammoun, Meegahapola, and Gatica-Perez 2023);
- inferring mood-while-eating (Bangamuarachchi et al. 2025);
- the generation of contextually rich data with other reference datasets (Giunchiglia and Li 2024).
References
Footnotes
The data requests will be received starting from April 3rd, instead of March 28th, due to final adjustments in the distribution procedure.↩︎