Federica Bologna

A portrait of me in Venice Beach, LA at sunset

I’m a PhD candidate in Information Science at Cornell University, where I’m advised by Matthew Wilkens. My research lies at the intersection of natural language processing, healthcare, and gender.

Here are a few research directions I worked on and am currently excited about:

Evaluation of clinical models: After studying the integration of medical alerts within nurses workflows at NYC Health + Hospitals, I have been increasingly interested in evaluating clinical QA models.
Large-scale analysis of patient needs in online health communities: I have analyzed user needs and support strategies in endometriosis online communities, as well perceptions of ablation and excision surgery.
Interaction between gender and literature: I have conducted a randomized experiment with 3,000 participants to measure the causal effect of character gender on reader’s preferences. I have used stylometry to explore style and gender play in Terze Rime by Veronica Franco.

I strive to ground my work through the theoretical frameworks of ethics of care and studying up. I enjoy using a combination of quantitative - NLP, causal inference, statistical analysis - and qualitative methods - surveys, annotations, interviews.

For a more detailed description of my research projects peek below 👀

News

Jun 2025	Starting my Research Intern position at the Allen Institute for AI!
Feb 2025	Paper published in the Journal of Medical Internet Research!
Aug 2024	Paper published in The Journal of Minimally Invasive Gynecology!
Feb 2024	I passed my Candidacy exam! Here is a curated selection of my slide deck.
Nov 2023	Spoke on the main stage at AAGL 2023 about how endometriosis online communities and NLP can tell us how to improve care and reduce diagnostic wait times.

Research Timeline

Recently, I have evaluated the information quality, empathy and actionability of GPT4 responses to endometriosis patient questions. Specifically, I compared GPT4 performance at answering endometriosis questions vs general medical questions, as well as across prompting strategies. We find that prompting GPT-4 to answer medical questions in the role of a patient advocate increases the empathy and actionability but not the information quality of the replies.

Over the past year, I have designed a large-scale randomized survey experiment to measure the causal effect of character gender on reader preferences. Ian Lundberg, Matthew Wilkens, and I found that character gender has a minimal effect on readers’ preferences. These findings contradict a long standing belief in the publishing industry that men and boys are only interested in reading about people of the same gender identity. We are currently working on the paper detailing this project.

In the summer of 2023, under the guidance of Doctor Kristen Pepin, I expanded my previous work on endometriosis online communities by analyzing user sentiment toward mentions of excision or ablation surgery, as well as support type and topics most associated with each surgery. We presented this work at the 2023 global congress of the American Association of Gynecologic Laparoscopists (AAGL) and the findings have been published in the Journal of Minimally Invasive Gynecology.

As PiTech Impact Fellow, I have worked with NYC Health + Hospitals to better integrate medical alerts within nurses workflows. I evaluated whether the recent intervention had successfully decreased the number of overridden alerts, and interviewed nurses at NYC H+H facilities to draw design interventions that better support their work.

Before that, I’ve investigated patient needs and support strategies in endometriosis communities using Natural Language Processing on Reddit with Rosamond Thalken. We find that patients need greater empathy within clinical settings, easier access to appointments, more information on care pathways, and further support for their loved ones. Here is the pre-print!

I have also collaborated with Marilyn Migiel and Giulia Andreoni, to identify the “Unknown Male Author” in Veronica Franco’s poetry collection Terze rime with machine learning. This work is currently under review.

Prior to joining Cornell, I’ve worked as a Research Assistant at the University of Bologna, Italy, with Angelo Di Iorio, Silvio Peroni, and Francesco Poggi. There, I examined the availability of open access bibliographic data for Italian publications across fields (published in QSS), and whether bibliometrics extracted from open access dataset can provide insights on the evaluation of academic researchers (published in Scientometrics).

I received a Master’s degree in Modern, Post-colonial and Comparative Literature at the University of Bologna, Italy, with a thesis on computational approaches to model urban space in science fiction literature. This work resulted in a publication on The Journal of Cultural Analytics.