• Barbara Pernici, Cinzia Cappiello, Carlo Bono, Camilla Sancricca, Tiziana Catarci, Marco Angelini, Matteo Filosa, Matteo Palmonari, Flavio De Paoli, Sonia Bergamaschi, Giovanni Simonini, Angelo Mozzillo, Luca Zecchini, Sustainable quality in data preparation, accepted for publication on ACM Journal on Data and Information Quality, https://doi.org/10.1145/3769120
    • Barbara Pernici, Cinzia Cappiello, Edoardo Ramalli, Matteo Palmonari, Federico Belotti, Flavio De Paoli, Angelo Mozzillo, Luca Zecchini, Giovanni Simonini, Sonia Bergamaschi, Tiziana Catarci, Matteo Filosa, Marco Angelini, Dario Benvenuti (2024). The Future of Sustainable Data Preparation. SEBD 2024: 486-497  https://ceur-ws.org/Vol-3741/paper27.pdf
    • Cremaschi, M., Belotti, F., D’Souza, J., & Palmonari, M. (2025). MammoTab 25: A large-scale dataset for semantic table interpretation—Training, testing, and detecting weaknesses. In Proceedings of the 24th International Semantic Web Conference (ISWC 2025) link OA https://hdl.handle.net/10281/576467 
    • Ghilardi, D., Belotti, F., Molinari, M., Ma, T., & Palmonari, M. (2025). Group-SAE: Efficient Training of Sparse Autoencoders for Large Language Models via Layer Groups. In The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025).  link OA https://aclanthology.org/2025.emnlp-main.942
    • Bono, C., Belotti, F., & Palmonari, M. (2025). Efficient uncertainty estimation for LLM-based entity linking in tabular data. In Proceedings of the 20th International Workshop on Ontology Matching (OM 2025), co-located with the 24th International Semantic Web Conference (ISWC 2025), Nara, Japan, November 2–3, 2025 (to appear). https://doi.org/10.1007/978-981-96-7238-7_11 link OA https://arxiv.org/abs/2510.01251
    • Alidu, A., Ciavotta, M., & De Paoli, F. (2025, February). SemT: A Framework for Enhancing Tabular Data Through Enrichment-as-a-Service. In European Conference on Service-Oriented and Cloud Computing (pp. 33-39). Cham: Springer Nature Switzerland. https://link.springer.com/chapter/10.1007/978-3-031-84617-5_3
    • Alidu, A., Ciavotta, M., & De Paoli, F. (2025, July). Prompt2DAG: A Modular Prompting Approach for Democratizing Data Pipeline Generation. In 2025 IEEE International Conference on Software Services Engineering (SSE) (pp. 1-11). IEEE. DOI: 10.1109/SSE67621.2025.00010
    • Pérez-Messina, I., Angelini, M., Ceneda, D., Tominski, C. and Miksch, S. (2025), Coupling Guidance and Progressiveness in Visual Analytics. Computer Graphics Forum, 44: e70115. https://doi.org/10.1111/cgf.70115
    • Matteo Filosa; Alexandra Plexousaki; Matteo Di Stadio; Francesco Bovi; Dario Benvenuti; Tiziana Catarci, and Marco Angelini, “TraVIS: A User Trace Analyzer to Support User-Centered Design of Visual Analytics Solutions,” in IEEE Transactions on Visualization and Computer Graphics, doi: 10.1109/TVCG.2025.3546863link
    • Francesco Pugnaloni, Luca Zecchini, Matteo Paganelli, Matteo Lissandrini, Felix Naumann, and Giovanni Simonini. 2025. Table Overlap Estimation through Graph Embeddings. Proc. ACM Manag. Data 3, 3 (SIGMOD), Article 228 (June 2025), 25 pages. https://doi.org/10.1145/3725365
    • Luca Zecchini, Vasilis Efthymiou, Felix Naumann, and Giovanni Simonini. Deduplicated Sampling On-Demand. PVLDB, 18(8): 2482 – 2495, 2025. doi:10.14778/3742728.3742742
    • Luca Zecchini, Ziawasch Abedjan, Vasilis Efthymiou, and Giovanni Simonini. RadlER: Deduplicated Sampling On-Demand. PVLDB, 18(12):5319 – 5322, 2025. doi:10.14778/3750601.3750661 
    • Edoardo Ramalli, Carlo Alberto Bono, Camilla Sancricca, Cinzia Cappiello, Marco Comuzzi, Barbara Pernici, Monica Vitali, Entity ablation of knowledge graphs: Impact on information quality and sustainability, accepted for publication on Future Generation Computing Systems, Volume 175, February 2026, 108063 link
    • Mozzillo, A., Zecchini, L., Gagliardelli, L., Aslam, A., Bergamaschi, S., Simonini, G. (2025). Evaluation of Dataframe Libraries for Data Preparation on a Single Machine. EDBT 2025 https://arxiv.org/abs/2312.11122
    • C. Bono, B. Pernici, Quality-informed Active Learning on Social Media in Crisis Scenarios, HHAI 2025: The 4th International Conference Series on Hybrid Human-Artificial Intelligence, June 9-13, 2025 (Pisa, Italy) https://ebooks.iospress.nl/volumearticle/74992
    • Amir Hossein Mohsen Nezhad Baravati, Martina Viganò, Carlo Alberto Bono, Barbara Pernici, Enhancing User Experience with Topic-Based Message Retrieval in Telegram, The 40th ACM/SIGAPP Symposium On Applied Computing, Catania, April 2025 https://dl.acm.org/doi/pdf/10.1145/3672608.3707753
    • Pozzi, R., Barbera, V., Alva Principe, R., Giardini, D., Rubini, R., & Palmonari, M. (2025). Combining knowledge graphs and nlp to analyze instant messaging data in criminal investigations. In International Conference on Web Information Systems Engineering (pp. 427-442). Springer, Singapore. https://doi.org/10.1007/978-981-96-0567-5_30
    • Pozzi, R., Palmonari, M., Coletta, A., Bellomarini, L., Lehmann, J., & Vahdati, S. (2025). ReFactX: Scalable Reasoning with Reliable Facts via Constrained Generation. ISWC 2025. arXiv preprint arXiv:2508.16983. https://arxiv.org/pdf/2508.16983 link OA https://arxiv.org/abs/2508.16983
    • A. Ulmer, M. Angelini, J. -D. Fekete, J. Kohlhammer and T. May, “A Survey on Progressive Visualization,” in IEEE Transactions on Visualization and Computer Graphics, vol. 30, no. 9, pp. 6447-6467, Sept. 2024, doi: 10.1109/TVCG.2023.3346641
    • Aslam A, Simonini G, Gagliardelli L, Zecchini L, Bergamaschi S. (2024). Stream-aware indexing for distributed inequality join processing. Information Systems. 2024 Nov 1;125:102425. https://doi.org/10.1016/j.is.2024.102425, link OA 
    • Filosa, M., Plexousaki, A., Benvenuti, D., Catarci, T., Angelini, M. (2024). InterView: A System to Support Interaction-Driven Visualization Systems Design. In: Lárusdóttir, M.K., Naqvi, B., Bernhaupt, R., Ardito, C., Sauer, S. (eds) Human-Centered Software Engineering. HCSE 2024. Lecture Notes in Computer Science, vol 14793. Springer, Cham.  https://dl.acm.org/doi/10.1007/978-3-031-64576-1_23, link
    • Zecchini, L., Bleifuß, T., Simonini, G., Bergamaschi, S., & Naumann, F. (2024). Determining the Largest Overlap between Tables. Proceedings of the ACM on Management of Data, 2(1), 1-26. https://dl.acm.org/doi/10.1145/3639303
    • Zecchini, L., Bleifuß, T., Simonini, G., Bergamaschi, S., & Naumann, F. (2024). Overlap-Based Duplicate Table Detection. In CEUR Workshop Proceedings (Vol. 3741, pp. 643-652). https://ceur-ws.org/Vol-3741/paper24.pdf
    • Mattia Salnitri, Edoardo Ramalli, Barbara Pernici, Towards a policy tuning method for data ecosystems, IEEE Symposium on Services for Data & Model Ecosystems (SDMEs), Shenzhen, China, July 2024 (invited paper) https://ieeexplore.ieee.org/document/10707453
    • Giaccaglia, P., Bono, C. A.,  Pernici, B. (2024). Enhancing Emergency Post Classification through Image Information Amplification via Large Language Models. ISCRAM Proceedings, 21. https://ojs.iscram.org/index.php/Proceedings/article/view/38
    • Marco Comuzzi, Sungkyu Kim, Jonghyeon Ko, Musa Salamov, Cinzia Cappiello,  Barbara Pernici, On the Impact of Low-Quality Activity Labels in Predictive Process Monitoring,  ICPM Workshop on ML4PM – Leveraging Machine learning in Process Mining, Oct. 2024 https://re.public.polimi.it/retrieve/3795fb05-25ca-45d5-b1ae-e49dd83c65b1/ICPM_2024_paper_175_submitted.pdf
    • Alidu, A., Ciavotta, M., & Paoli, F. D. (2024, December). LLM-based DAG creation for data enrichment pipelines in semt framework. In International Conference on Service-Oriented Computing (pp. 131-143). Singapore: Springer Nature Singapore. https://doi.org/10.1007/978-981-96-7238-7_11
    • Moiraghi, F., Palmonari, M., Allavena, D., & Morando, F. (2024, July). Zero-shot hierarchical classification on the common procurement vocabulary taxonomy. In 2024 IEEE 48th Annual Computers, Software, and Applications Conference (COMPSAC) (pp. 273-278). IEEE. https://arxiv.org/pdf/2405.09983
    • De Paoli, F., Avogadro, R., Ripamonti, M., & Palmonari, M. (2024, January). Interactive Enrichment of Tabular Data with SemTUI. In ISWC (Posters/Demos/Industry). https://ceur-ws.org/Vol-3828/paper36.pdf
    • Agazzi, R., Alva Principe, R., Pozzi,  R., Ripamonti, M., & Palmonari, M. DAVE: A Framework for Assisted Analysis of Document Collections in Knowledge-Intensive Domains. IJCAI Demo Papers https://www.ijcai.org/proceedings/2025/1246 
    • Rubini, R., Vimercati, M., & Palmonari, M. (2024, December). PROMET: Parameter-Efficient Few-Shot Fine-Grained Entity Typing with Implicit Mask Filling. In 2024 IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT) (pp. 141-149). IEEE. DOI: 10.1109/WI-IAT62293.2024.00027