D1 @ UbiComp 2025
  • Home
  • Call for Papers
  • Organizers
  • Program Committee

On this page

  • Important dates
  • Important links
  • The dataset
  • Paper submission
  • Paper presentation
  • Paper Awards
  • Extended version in journal
  • How to get the datasets
  • Contacts
  • Examples of previous studies
    • References

Call for Papers

The open challenge aims to explore the DiversityOne dataset, one of the largest and most geographically diverse smartphone datasets about everyday life behavior. The dataset combines questionnaires about demographic and psychosocial variables from 18K participants, and passive smartphone sensor data and self-reported annotation from 782 students across eight universities in eight countries.

This dataset is a rich and valuable research resource that can be used to address research questions in multiple fields: machine learning, mobile sensing, computational social science, design with data, and many others. The open challenge offers the opportunity to work on the dataset and gain useful feedback on your research.

We welcome contributions from researchers from diverse backgrounds and geographical provenances. In particular, we welcome contributions that address aspects including, but not limited to:

  • AI/ubiquitous computing/mobile sensing

    • data-centric AI
    • interactive machine learning
    • noisy annotation detection and correction
    • domain adaptation
    • transfer learning
    • activity and mood recognition
    • responsible and ethical AI
  • Computational social science

    • network analysis of social systems
    • sequence analysis of diary data
    • analysis of communities of practices
    • machine learning or rule-based analysis of social behavior
  • Designing with data

    • studies focusing on the design and documentation of the dataset collection
    • studies focusing on the design affordances of the dataset
    • data-centric design
    • user-centered design

Important dates

  • From March 28, 2025 April 3, 20251: Submit your research proposal using the web form and request the datasets that you need to answer your research questions. Please specify that you are requesting the dataset to submit your work to the workshop. The full list of available datasets and documentation is accessible on the data catalog.

  • June 8, 2025: Abstract deadline. Link to submission platform TBA

  • June 15, 2025 23:59 AoE: Submission deadline

  • June 29, 2025: Author notification

  • July 30, 2025: Deadline for the camera-ready version of workshop papers to be included in the ACM DL

  • October 12 or 13, 2015: Full-day Workshop.

Important links

Dataset paper describes the methodology and the research protocol.

Data catalog lists all available data.

Dataset request form to send the research proposal and to request copy of the datasets.

Dataset webpage summarizes the key information about the data.

Conference webpage UbiComp / ISWC 2025

The dataset

To participate, you are expected to use the DiversityOne dataset. This dataset is the result of the large-scale European project “WeNet - The Internet of US”. The dataset paper describes in detail the dataset together with the data collection methodology and statistics. contains data from college students in eight countries: China, Denmark, India, Italy, Mexico, Mongolia, Paraguay, and the United Kingdom. The study followed ethical approval procedures in each of the participating institutions and is compliant with the European General Data Protection Regulation (GDPR).

The dataset contains questionnaire answers from over 18K students, 782 of whom agreed to participate in a longitudinal survey of four weeks. The study used the iLog app to collect data from smartphone sensors such as accelerometer, gyroscope, and GPS, as well as derived information such as notification interactions, app usage, activities, and step counts. During this period, participants reported their activities, locations, social context, and mood, with daily reports on sleep quality and daily expectations.

The combination of sensor data and self-reported data across eight universities worldwide fosters research in ubiquitous computing, mobile sensing, machine learning, computational social science, and design with data. The dataset also opens the possibility for the design of new data-driven studies of diverse human behavior.

Paper submission

Short paper (max 4 pages, excluding references). The submitted works are expected to reflect on, analyze, or test the DiversityOne dataset. Submitted papers should report the work’s motivation, methodology, results, possible future analyses, and include an ethical statement describing potential societal impact.

The paper should follow the UbiComp/ISWC template and guideline. All papers and any supplementary material must be anonymized. Each submission will be reviewed by two reviewers from a panel of experts. All authors are asked to adhere to the Accessible Submission Guidelines.

All accepted publications will be published on the ACM Digital Library as part of the UbiComp 2025 proceedings. At least one author of each accepted paper needs to register for the workshop. During the workshop, each paper will be presented by one of the authors. We are exploring with the conference and workshop chairs the possibility of supporting video/remote presentations and reduced rates for authors unable to travel to the conference to promote diversity and facilitate participation.

Paper presentation

The authors of the accepted paper will be invited to present their work during the workshop. The presentation timing will be defined based on the number of submissions (tentatively, the presentation will be between 10 minutes, with 5 minutes for Q/A and discussion, in which we encourage participants to exchange ideas and approaches). The final timing will be communicated before the workshop.

Paper Awards

Papers will receive a special mention during the workshop in the form of a best paper award and a best paper runner-up award. The evaluation will be made by a committee based on the following criteria:

  • creativity: how original or novel the analysis is;
  • multidisciplinary: how well it combines ideas and approaches from multiple disciplines;
  • presentation: how clear the presentation is and how well written the paper is;
  • impact: how likely is it that the work can lead to impactful results if the paper is further extended.

Extended version in journal

A selection of the accepted papers will be invited to submit an extended version to IEEE Pervasive Computing.

How to get the datasets

  1. Navigate the data catalog. The catalog is available on the data catalog. Please look at the available datasets and documentation. The catalog contains

    • basic datasets (e.g., one single modality such as accelerometer or annotations for one university)
    • dataset bundles of basic sensors (e.g., motion sensors grouping accelerometer and step counter) split by data collection location.
    • projects grouping all data of one single university.

    Any composition of bundle, datasets and university project can be requested coherently with your research proposal.

  2. Proposal submission. All authors must submit a data download request through the web form that will be made available from April 3, 2025 by providing the following information:

    • Names and contact information of the authors working on the dataset and their institution.
    • Description of the research proposal. Provide a description of your idea and how you plan to use the dataset.
    • Select from a list the datasets to download. On the data catalog, you can find and navigate the list of downloadable datasets and navigate the codebooks.

    The key requirements to obtain a copy of the data are the affiliation with a research institution, either private or public, the coherence between the requested data and the research proposal, and the acceptance of the Terms and License Agreement. Key licensing terms include:

    • datasets are used exclusively for research purposes;
    • redistribution of the datasets is prohibited;
    • datasets cannot be publicly shared (e.g., on a website);
    • any attempt to reverse engineer any portion of the data or to re-identify the participants is strictly forbidden and could constitute unlawful processing of personal data.
  3. Dataset download. The proposal request is evaluated by the University of Trento (UNITN), and in case of a positive response, the participants receive an email with the instructions for downloading the dataset after a few days. The requested data are shared with the participants through dedicated storage.

Contacts

For any questions related to the workshop or technical support regarding the datasets, please do not hesitate to reach out to the organizers at datadistribution.knowdive [at] unitn.it.

Examples of previous studies

Previous studies investigated various aspects of high-quality rich datasets that include sensor data and self-reports. Some examples are (for papers based on DiversityOne, we list the sensor modalities used):

  • the use of social media (Giunchiglia et al. 2018)

    Datasets: the authors used the SmartUnitn2 dataset, which has the same modalities and variables as DiversityOne: time diaries, application usage, screen status.

  • the quality of answers and mislabeling (Bontempelli et al. 2020)

    Datasets: authors used SmartUnitn2 dataset, a similar dataset. The corresponding input modalities in DiversityOne are time diaries, acceleration, screen status, airplane mode, gyroscope, ring mode, battery charge, battery level, magnetic field, doze modality, headset plugged in, music playback, location, WiFi network connected to, proximity, WiFi networks available, Bluetooth, running application, notifications, atmospheric pressure (the following modalities are not available in DiversityOne: linear acceleration, gravity, rotation vector, orientation, temperature, humidity, detect incoming and outgoing calls, detect incoming and outgoing SMS).

  • the usefulness of self-reports towards understanding the user’s subjective perspective of the local context (Zhang et al. 2021);

    Datasets: SmartUnitn2 dataset. The corresponding input modalities in DiversityOne are time diaries, acceleration, screen status, airplane mode, gyroscope, ring mode, battery charge, battery level, magnetic field, doze modality, headset plugged in, music playback, activity performed (Google Activity Recognition API), location, WiFi network connected to, proximity, WiFi networks available, Bluetooth, notifications, atmospheric pressure (the following modalities are not available in DiversityOne: linear acceleration, gravity, rotation vector, orientation, temperature, humidity).

  • the impact of COVID on the students’ lives (Girardini et al. 2023)

    Datasets: the authors relied on SmartUnitn2 dataset and DiversityOne in Italy: time diaries.

  • cross-individual activity recognition (Shen et al. 2022);

  • mood inference (Meegahapola et al. 2023)

    Datasets: for all countries: location, Bluetooth, WiFi, cellular, notifications, proximity, activity steps, screen events, user presence, touch events, app events, time diaries.

  • diversity perceptions in a community (Kun et al. 2022);

  • activity recognition (Bouton-Bessac, Meegahapola, and Gatica-Perez 2022)

    Datasets: accelerometer and time diaries data of Denmark, UK, Mongolia, Paraguay and Italy.

  • social context inference while eating (Kammoun, Meegahapola, and Gatica-Perez 2023)

    Datasets: for all countries activity type, step count, location, phone signal, WiFi, Bluetooth, battery, and proximity, notifications, application usage, screen episodes user presence and time diaries.

  • inferring mood-while-eating (Bangamuarachchi et al. 2025)

    Datasets: for all countries: location, Bluetooth, WiFi, cellular, notifications, proximity, activity, steps detector, step counter, screen events, user presence, touch events, app events, time diaries.

  • the generation of contextually rich data with other reference datasets (Giunchiglia and Li 2024).

    Datasets: authors used SmartUnitn2 dataset, a similar dataset. The corresponding input modalities in DiversityOne are location and time diaries.

References

Bangamuarachchi, Wageesha, Anju Chamantha, Lakmal Meegahapola, Haeeun Kim, Salvador Ruiz-Correa, Indika Perera, and Daniel Gatica-Perez. 2025. “Inferring Mood-While-Eating with Smartphone Sensing and Community-Based Model Personalization.” ACM Transactions on Computing for Healthcare (HEALTH). https://arxiv.org/abs/2306.00723.
Bontempelli, Andrea, Stefano Teso, Fausto Giunchiglia, and Andrea Passerini. 2020. “Learning in the Wild with Incremental Skeptical Gaussian Processes.” In IJCAI. https://www.ijcai.org/proceedings/2020/0399.pdf.
Bouton-Bessac, Emma, Lakmal Meegahapola, and Daniel Gatica-Perez. 2022. “Your Day in Your Pocket: Complex Activity Recognition from Smartphone Accelerometers.” In International Conference on Pervasive Computing Technologies for Healthcare, 247–58. Springer. https://www.idiap.ch/\~gatica/publications/BoutonBessacEtAl-ph22.pdf.
Girardini, Nicolò Alessandro, Simone Centellegher, Andrea Passerini, Ivano Bison, Fausto Giunchiglia, and Bruno Lepri. 2023. “Adaptation of Student Behavioural Routines During Covid-19: A Multimodal Approach.” EPJ Data Science 12 (1): 55. https://epjds.epj.org/articles/epjdata/abs/2023/01/13688_2023_Article_429/13688_2023_Article_429.html.
Giunchiglia, Fausto, and Xiaoyue Li. 2024. “Big-Thick Data Generation via Reference and Personal Context Unification.” In ECAI 2024, 1975–84. IOS Press. https://ebooks.iospress.nl/doi/10.3233/FAIA240713.
Giunchiglia, Fausto, Mattia Zeni, Elisa Gobbi, Enrico Bignotti, and Ivano Bison. 2018. “Mobile Social Media Usage and Academic Performance.” Computers in Human Behavior 82: 177–85. https://arxiv.org/abs/2004.01392.
Kammoun, Nathan, Lakmal Meegahapola, and Daniel Gatica-Perez. 2023. “Understanding the Social Context of Eating with Multimodal Smartphone Sensing: The Role of Country Diversity.” In Proceedings of the 25th International Conference on Multimodal Interaction, 604–12. https://arxiv.org/abs/2306.00709.
Kun, Peter, Amalia de Götzen, Miriam Bidoglia, Niels Jørgen Gommesen, and George Gaskell. 2022. “Exploring Diversity Perceptions in a Community Through a q&a Chatbot.” In DRS2022: Bilbao Design Research Society, 1–19. https://arxiv.org/abs/2402.08558.
Meegahapola, Lakmal, William Droz, Peter Kun, Amalia De Götzen, Chaitanya Nutakki, Shyam Diwakar, et al. 2023. “Generalization and Personalization of Mobile Sensing-Based Mood Inference Models: An Analysis of College Students in Eight Countries.” Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 6 (4): 1–32. https://arxiv.org/abs/2211.03009.
Shen, Qiang, Haotian Feng, Rui Song, Stefano Teso, Fausto Giunchiglia, Hao Xu, et al. 2022. “Federated Multi-Task Attention for Cross-Individual Human Activity Recognition.” In IJCAI, 3423–29. IJCAI. https://www.ijcai.org/proceedings/2022/0475.pdf.
Zhang, Wanyi, Qiang Shen, Stefano Teso, Bruno Lepri, Andrea Passerini, Ivano Bison, and Fausto Giunchiglia. 2021. “Putting Human Behavior Predictability in Context.” EPJ Data Science 10 (1): 42. https://epjds.epj.org/articles/epjdata/abs/2021/01/13688_2021_Article_299/13688_2021_Article_299.html.
Back to top

Footnotes

  1. The data requests will be received starting from April 3rd, instead of March 28th, due to final adjustments in the distribution procedure.↩︎