Metch Nelson Senior Thesis 2025

Page 1


The Impact of All of Us Research Program on Personalized

Medicine and Health Disparities

Metch Nelson

Senior Thesis | 2025

The Impact of All of Us Research Program on Personalized Medicine and

Health Disparities

Introduction: The groundbreaking shift of personalized medicine in healthcare has the potential to change many lives. Traditional medical treatments used today rely on generalized medications designed to work for most patients. These medical treatments pay little regard to important factors in a patient's health such as genetics, environment, and lifestyle, which differ from patient to patient. Although it may be effective for some individuals, this “one-size-fits-all” approach to medicine overlooks some unique challenges that specific communities and the patients within them might face, particularly those of underrepresented groups in medical research. Former president Barack Obama recognized this disparity and, as a result, launched the Precision Medicine Initiative (PMI) in 2015, in which he envisioned a healthcare experience that was tailored to an individual's specific needs and history. At the core of the PMI exists the All of Us Research Program, a forerunner in the extensive effort of the National Institute of Health (NIH). The All of Us Program concentrates on creating the most diverse dataset of people, including people of differing ethnic, racial, socioeconomic backgrounds, and geographic groups because of the history of earlier genomics projects that predominantly include individuals of European descent. Its goal is to recruit at least one million participants from every single state in the United States, leveraging this diversity to

attempt to advance personalized medicine while simultaneously addressing long-running health disparities.1

Beyond simply attempting to address the numerous historical gaps in genomic research, the All of Us Research Program signifies a paradigm shift in how we understand health and disease. By capturing data from wearable devices and electronic health records (EHRs), the program integrates real-time health metrics into its data set, paving the way for predictive models that could revolutionize early intervention and disease prevention. Additionally, the program’s emphasis on inclusivity extends beyond recruiting a diverse set of participants. It fosters collaboration between clinicians, community leaders, and researchers alike to ensure that the findings of the study are actionable and equitable. This holistic research approach promises to transform individual treatment plans but also holds a significant amount of potential for reshaping public health policy. By creating a link between diverse datasets and broader system change, the All of Us Program will help create a healthcare system that works for every individual within it. Using All of Us research, policymakers could use these findings to allocate resources toward implementing targeted prevention programs. Additionally, the program’s focus on the social determinants of health could influence broader public health strategies that affect these inequalities.2

Genomic discoveries from All of Us might also lead to policy changes in areas such as insurance coverage, ensuring that

1 All of Us Research Program. “ National Institutes of Health.” U.S. Department of Health and Human Services.

2 All of Us Research Program. “ National Institutes of Health.” U.S. Department of Health and Human Services.

personalized treatments and preventative care are accessible to all, regardless of socioeconomic status. Furthermore, the program’s emphasis on data privacy and participant engagement could serve as a model for future regulations governing the ethical use of genomic and health data, fostering trust and transparency in largescale research initiatives. There are three objectives for this paper: to analyze what the All of Us Program contributes to the field of personalized medicine, evaluate its role in addressing longtime health disparities, and examine the ethical and privacy challenges within such a large-scale genomic project.3

Foundations of Precision Medicine:

Precision medicine also referred to as personalized medicine, tailors medical treatments and all other forms of healthcare to the specific needs of each individual patient. It considers the genetic makeup of an individual as well as their environment and lifestyle to provide the most effective and preventative treatments for them specifically. The concept gained in popularity with the founding of the Human Genome Project. The Human Genome Project mapped the complete sequence of human DNA and offered incredible, groundbreaking insights. While this project provided incredible insight into the human genome, its participants were predominantly white and of European origin, which made the results not as generalizable,

3 All of Us Research Program. “ National Institutes of Health.” U.S. Department of Health and Human Services.

which wasn’t ideal.4 The All of Us Research seeks to learn from the experience of the Human Genome Project and instead emphasize diversity. In addition to the Human Genome Project, other earlier initiatives such as the Geisinger’s MyCode Community Health Initiative or Iceland’s deCODE Genetics didn’t fully reflect the wide demographic variety of the United States. As a result of the history of these major initiatives, All of Us seeks to redress this trend by prioritizing inclusivity such that the benefits of precision medicine extend to everyone, not just those historically represented in these types of research programs. Beyond representation, advancements in gene-editing technologies, such as CRISPR, have opened doors for genomic research to transition from analysis to actionable interventions. CRISPR allows for precise edits to the human genome, offering opportunities to address genetic disorders that disproportionately affect certain populations, such as sickle cell anemia in African Americans. Artificial intelligence (AI) also plays a crucial role in analyzing the vast datasets generated by programs like All of Us. Machine learning algorithms are capable of identifying patterns and correlations in genetic and environmental data that may be imperceptible to human researchers. For example, AI-driven tools can predict an individual’s risk of developing diseases, enabling earlier interventions and personalized treatment plans. These

4 Collins, F. S., & Varmus, H. (2015). "A New Initiative on Precision Medicine." New England Journal of Medicine, 372(9), 793-795.

technologies underscore how All of Us can bridge the gap between research and clinical applications.5

Addressing Disparities in Healthcare:

Health disparities are systematic differences in health outcomes between different ethnic, racial, geographic, or socioeconomic groups because of deeply rooted structural inequalities. Historically speaking, a good portion of medical research has overlooked marginalized populations. As a result of being overlooked, the groundbreaking treatments and interventions that come from this research have been less effective or, in some cases, harmful towards individuals within these groups. Data from clinical trials, for instance, has often been skewed for people of European descent. Consequently, this left very little understanding of how specific medications and treatments affected people of other racial and ethnic backgrounds. The All of Us Program directly tackles this issue by intentionally recruiting participants of a variety of backgrounds. In doing so, the program focuses on uncovering how the many social determinants of health combine to produce disparities in health outcomes. For example, studies using All of Us data have revealed disparities in the prevalence of chronic diseases such as hypertension and diabetes in Hispanic and African American populations. These findings highlight the importance of understanding the unique genetic, environmental, and behavioral factors contributing to these conditions, enabling targeted prevention and treatment strategies. Case studies have

5 Collins, F. S., & Varmus, H. (2015). "A New Initiative on Precision Medicine." New England Journal of Medicine, 372(9), 793-795.

shown how data from All of Us has been used to improve healthcare delivery in underserved communities. For instance, in rural areas, insights into environmental factors such as air quality and limited access to healthcare facilities have led to the development of mobile clinics and telehealth initiatives. These efforts demonstrate how the program's findings can directly inform community-level interventions and reduce barriers to care.6

Privacy and Ethics:

Large-scale genomic research projects, including All of Us, the Human Genome Project, Geisinger’s MyCode Community Health Initiative, and Iceland’s deCODE Genetics, have important ethical and privacy questions. With that in mind, the All of Us Program has taken precautions to protect its participants' privacy by de-identifying data and utilizing a tiered access system. Despite this effort, participants and critics alike have concerns about the potential for the re-identification of participants, especially as technological advances continue, which could make it easier to match the All of Us genomic data with other information, such as information found online. The various emerging risks in data security, such as advancements in hacking techniques, pose a significant amount of additional strategies to genomic privacy. The reidentification of genomic data could lead to discrimination in insurance, employment, and many more areas. This underscores the need for strong legal frameworks, including updates to the Genetic Information Nondiscrimination Act (GINA), to protect

6 All of Us Research Program Investigators. (2019). "The 'All of Us' Research Program." New England Journal of Medicine, 381(7), 668-676.

individuals from these risks. This could cause abuse of the All of Us and similar research projects. Historical abuse of medical research, namely the abuse of Henrietta Lacks’ cells, highlights how important it is to build trust with the participants. To remedy this, All of Us addresses all of these concerns by holding consultations such as focus groups and satisfaction surveys, which engage the communities they hope to represent. These efforts are particularly vital for historically marginalized populations, who may be hesitant to participate due to past exploitation in research. Additionally, the program has adopted a participatory approach, involving community leaders in recruitment and ethical oversight to ensure transparency and accountability.7

Methodology:

Data Collection:

The All of Us program has an extremely extensive and comprehensive data collection strategy for effectively reviewing participant health through a holistic process. Participants are supposed to provide various forms of biological samples, including saliva and blood, which are stored securely in a biobank system. After these samples are securely stored, they undergo genomic sequencing, which determines the order of the chemical bases that make up DNA. After they undergo genomic sequencing, researchers can then identify genetic variants that could potentially influence an individual's health and susceptibility to disease. In addition to this descriptive amount of biological data, participants

7 National Institutes of Health. (2020). "All of Us Research Program: Data Privacy and Security."

are encouraged to complete several surveys of many different categories like lifestyle, family history, overall health, and social determinants of health. Electronic health records (EHRs) offer insights into medical diagnoses, treatments, and outcomes and are integrated into the dataset. Participants wear devices that track their health, like fitness trackers, which collect continuous data on a particular participant's health metrics, sleep patterns, and activity. By fully utilizing all of these different data sources, All of Us attempts to create a comprehensive analysis of health across the United States population. However, challenges can arise in ensuring the reliability of the self-reported data as a result of biases or small inaccuracies that can happen based on participants' recollections or understanding of what the survey questions are asking. Integrating EHRs also has technical obstacles such as inconsistent data formats across healthcare providers, which affects standardization and analysis. Addressing these issues requires ongoing improvements in data validation processes and enhances collaboration with healthcare organizations to combine and standardize records.

Recruitment:

A major goal o All of Us is to recruit a diverse set of participants, and their recruitment strategies reflect this. Participants are drawn from all fifty states in the United States. This ensures that there is geographic representation from urban, rural, and tribal communities. The program often collaborates with community organizations, healthcare providers, and tribal leaders so that they can effectively engage historically unrepresented groups. For instance, All of Us hosts panels with tribal community

members, which has been incredibly vital in fostering a sense of trust among members of this community as well as encouraging participation among the many indigenous populations. All of Us’ access to detailed participant history through these panels as well as the surveys, provides detailed ancestry information that goes beyond traditional racial and ethnic categories. Despite this success, recruitment still faces many barriers. Mistrust of medical research, particularly among marginalized communities, remains a persistent challenge due to historical abuses, such as the Tuskegee Syphilis Study. Logistical difficulties, such as limited access to recruitment centers in rural areas or language barriers for nonEnglish speakers, further hinder participation. Innovative strategies, such as mobile recruitment units and multilingual outreach materials, have been implemented to address these obstacles, but continuous adaptation is needed to meet diverse community needs effectively

Data sharing/security:

All of Us categorizes data into three different tiers such that they can balance accessibility with privacy: public, registered, and controlled. The public tier includes aggregate data, making the information completely unidentifiable and accessible to all. The registered tier contains individual participant data, which is made available to specific approved researchers. The controlled tier includes genomic data and all highly sensitive information that needs many extensive safeguards to keep it private. Researchers who access the data need to complete a mandatory training course as well as sign a Data Use and Registration Agreement, known as DURA. All of Us put these measures into place to protect

participants' privacy while facilitating meaningful research. Additionally, participants are regularly updated through newsletters that keep them informed on the program's progress and their contributions to it. Balancing accessibility and privacy remains a critical challenge in genomic research. Policies must address the potential for misuse or re-identification of data, especially as technological advancements increase the risk of breaches. Lessons from other programs, such as the UK Biobank, highlight the importance of transparency and participant engagement in maintaining public trust. For example, the UK Biobank has implemented robust communication strategies to inform participants about how their data is being used, a practice All of Us could further expand upon. Additionally, crossinstitutional collaboration and the development of standardized security protocols could strengthen protections while facilitating broader research applications. Regular updates to participants and community consultations are also essential to maintaining trust, particularly as privacy concerns evolve alongside advances in data analytics and artificial intelligence.

Results: Contributions:

The All of Us data has contributed a significant amount to the advances in precision medicine and the associated types of research. For instance, research using the dataset has a variety of genetic variants that determine an individual's response to certain medications. These findings are paving the way for personalized prescribing practices, where health professionals can give patients more effective treatment based on factors specific to them,

reducing the risk of adverse reactions to medicine and improving overall health outcomes. Along with personalized prescribing practices, the program has enabled the development of tools to assess the risk of individuals in diverse populations. These tools allow for early detection of diseases and early preventative measures, which will be particularly useful to people within underserved communities. The tools identify individuals likely to benefit from certain cancer treatments, such as targeted therapies for breast and lung cancers. Enabled by data from All of Us, the tools are helping clinicians choose the most effective treatments for patients, minimizing trial-and-error approaches. Additionally, researchers have used the program's data to refine risk prediction models for cardiovascular diseases, creating screening protocols that are more accurate for diverse populations. For example, analysis of synthetic data inspired by All of Us reveals that 20% of participants reported having diabetes, while 18.5% experienced hypertension. These chronic conditions are particularly prevalent in underserved populations and highlight the critical role of precision medicine in addressing their unique needs. The development of tools such as pharmacogenomic algorithms for cardiovascular disease risk is instrumental in personalizing care for such high-risk groups. The program’s contributions extend beyond disease prevention, influencing early detection tools. Insights from wearable devices integrated into the dataset have enabled innovations like heart rhythm monitors, which can detect arrhythmias early in at-risk populations. This approach exemplifies how data from diverse participants can translate into actionable solutions in real-world healthcare settings.

Impact:

All of Us’ valuable collection of data on environmental exposures, including air pollution, access to nutritious foods, and their impact on health, is a result of the variety of data they’ve been able to gather because of the diverse amount of participants. This type of detailed information is critical for designing effective and nuanced targeted interventions that address the disparities in chronic diseases and many other health conditions. Longitudinal data collection, which refers to all of us’ strategic tracking of participants over a long period of time, offers additional nuanced insights into how health conditions change and evolve over time. This approach allows genomic researchers to identify trends and patterns that are often not very apparent in cross-sectional studies. This results in a much deeper understanding of health disparities and their underlying causes. The diverse dataset created by All of Us has provided valuable insights into longstanding health disparities. By examining the interplay between genetics, environment, and lifestyle, researchers have uncovered patterns in chronic diseases disproportionately affecting marginalized communities. For instance, studies have linked socioeconomic barriers, such as limited access to healthcare and education, with higher rates of diabetes and hypertension in African American and Hispanic populations. Data analysis further underscores these disparities. Participants with limited access to healthcare were overrepresented among those with diabetes (35 participants) and hypertension (36 participants). This highlights an urgent need to design equitable healthcare delivery models that prioritize underserved populations. Moreover, average air pollution exposure, measured as PM2.5, was found to be higher among

"Other" racial groups (12.65 µg/m³) and Hispanic participants (12.11 µg/m³), compared to White participants (11.88 µg/m³). These findings emphasize the need for targeted environmental health policies. The data also revealed a counterintuitive trend in physical activity: low-income participants reported slightly higher activity levels (3.12 hours/week) compared to high-income participants (2.87 hours/week). These insights challenge assumptions about socioeconomic status and health behaviors, indicating the need for nuanced public health strategies tailored to specific populations.

Ethical and Privacy Evaluations:

Large-scale genomic initiatives, including All of Us, face complex ethical and privacy challenges. The program has made significant strides in addressing these concerns through participant engagement and robust data protection measures. By deidentifying participant information and implementing tiered access controls, the program ensures that sensitive data is used responsibly and securely. Despite these precautions, concerns about data misuse and re-identification persist, particularly as technology advances.8 The analysis highlights the importance of trust-building measures. For instance, community consultations and satisfaction surveys have helped maintain high levels of trust among historically marginalized populations. These efforts are critical for addressing fears stemming from past medical research abuses, such as the Tuskegee Syphilis Study and Henrietta Lacks'

8National Institutes of Health. (2020). "All of Us Research Program: Data Privacy and Security.

unauthorized cell use. Furthermore, the All of Us model can benefit from best practices seen in the UK Biobank, which emphasizes transparent communication with participants about how their data is used to inform actionable outcomes, such as reducing environmental health risks.9 The program also sets a precedent for ethical genomic research by emphasizing equity in its recruitment and data-sharing practices. Participant feedback from synthetic datasets underscores the value of continuous engagement despite evolving privacy concerns alongside technological advancements. For instance, participant satisfaction surveys have revealed high levels of trust among marginalized groups, demonstrating the effectiveness of community engagement strategies. However, continuous evaluations are necessary to ensure that ethical standards evolve alongside advancements in technology and research methodologies. By fostering trust and transparency, All of Us can ensure its findings not only contribute to science but also to the equitable transformation of healthcare systems.

Discussion:

Strengths and Limitations:

The All of Us Research Program represents a huge step forward in addressing the historical gaps in genomic research. It emphasises inclusivity, which means that the broader range of populations is fully represented, increasing the generalizability of

9All of Us Research Program Investigators. (2019). "The 'All of Us' Research Program." New England Journal of Medicine, 381(7), 668-676.

its findings compared to other projects, including the Human Genome Project. Additionally, the use of diverse types of data like genomic, environmental, and behavioral offers a more holistic approach to understanding health outcomes. This approach not only benefits individual treatment plants but also, on a more broad scale can inform systemic interventions that could potentially reduce health disparities. Despite its strengths, the program does have some notable limitations. A significant challenge remains scalability because recruiting and retaining a diverse pool of participants requires sustained funding over a long period of time, resources, and trust-building measures. Additionally, generalizability, while it is improved it isn’t perfect.10 Some groups, such as undocumented immigrants or individuals with limited access to technology, may remain underrepresented due to logistical and ethical constraints. These gaps could perpetuate blind spots in the data, limiting the applicability of findings to truly marginalized populations. Moreover, biases in data collection persist despite efforts at inclusivity. For example, reliance on electronic health records (EHRs) introduces potential disparities, as not all participants have equal access to healthcare providers who maintain comprehensive records. Similarly, self-reported data, which forms a key component of the program, may reflect biases influenced by cultural perceptions or participant misunderstandings. Recognizing and mitigating these biases is crucial to maximizing the program’s impact.

10 Denny, J. C., et al. (2019). "The 'All of Us' Research Program." New England Journal of Medicine, 381(7), 668-676.

Implications for policy:

The findings from the All of Us Research Program have far-reaching implications for healthcare policy. By highlighting disparities in disease prevalence, access to care, and environmental exposures, the program provides evidence-based insights that policymakers can use to design targeted interventions. For instance, the program’s identification of higher air pollution exposure among racial minorities could support stricter regulations on industrial emissions in vulnerable communities. Similarly, the correlation between healthcare access and chronic conditions underscores the need for expanding Medicaid or implementing universal healthcare coverage. Legislative measures to protect genomic data privacy are also essential. Existing laws, such as the Genetic Information Nondiscrimination Act (GINA), provide a foundation but are insufficient for addressing the risks associated with large-scale genomic datasets. Policymakers should consider expanding protections to cover re-identification risks, particularly as advances in AI and data analytics make it easier to match deidentified data with other public information. Mandatory encryption standards, independent audits of genomic databases, and stronger penalties for data misuse could further enhance participant trust and safeguard sensitive information.11 Beyond privacy, the program’s findings should inspire proactive public health interventions. For example, community-based programs could leverage insights from All of Us to address disparities in physical activity among different income groups. Low-cost fitness initiatives tailored to low-income neighborhoods, coupled with

11 Denny, J. C., et al. (2019).

incentives for employers to promote workplace wellness, could help bridge gaps in physical activity and related health outcomes. Similarly, predictive tools developed from All of Us data could inform early screening programs for chronic diseases like diabetes and hypertension, particularly in underserved populations where these conditions are most prevalent. The program also emphasizes the importance of participatory policymaking. Engaging community leaders in the design and implementation of health policies ensures that solutions are culturally relevant and widely accepted. For example, tribal consultations within All of Us offer a model for involving Indigenous populations in decision-making, which could be replicated across other demographic groups. This participatory approach not only fosters trust but also ensures that policies are responsive to the needs of diverse communities. Finally, the longitudinal nature of the All of Us dataset offers unique opportunities for monitoring the effectiveness of policy interventions over time. Policymakers can use this data to evaluate the impact of environmental regulations, healthcare reforms, or public health campaigns, allowing for continuous improvement and adaptation. This feedback loop ensures that policies informed by All of Us findings remain dynamic and impactful, addressing the evolving challenges of public health and healthcare equity.

Conclusion:

The All of Us Research Program represents a monumental leap forward in the field of personalized medicine and healthcare equity. By prioritizing diversity and inclusivity, the program addresses long-standing gaps in genomic research, offering a clearer picture of how genetics, environment, and lifestyle shape

health outcomes. Its efforts not only pave the way for advancements in personalized treatments but also provide a critical foundation for addressing systemic health disparities that have persisted for generations. Moreover, the program’s emphasis on participant engagement fosters trust and transparency, setting a new standard for collaborative research. The potential for its findings to influence healthcare policies and funding allocations further underscores its transformative impact. The program’s potential to transform healthcare lies in its comprehensive and interdisciplinary approach. From enabling pharmacogenomic tools to refining early detection methods, All of Us demonstrates how diverse datasets can revolutionize how we diagnose, treat, and prevent disease. Equally significant are its contributions to public health policy, offering actionable insights into environmental exposures, healthcare access, and chronic disease prevalence. These findings have already begun to inform interventions that are more equitable, precise, and effective. However, to fully realize the transformative potential of All of Us, continued investment in inclusive and ethical genomic research is essential. Expanding privacy protections, refining recruitment strategies, and addressing biases in data collection must remain top priorities. This commitment will ensure that the benefits of genomic discoveries extend to all communities, particularly those historically excluded from such research. The All of Us Research Program exemplifies the intersection of science, ethics, and equity, offering a roadmap for future initiatives in precision medicine. Its success underscores the importance of diversity, transparency, and collaboration in building a healthcare system that truly works for everyone. By supporting and advancing programs like All of Us, we move closer

to a future where personalized medicine is not just a possibility but a reality for all.

References

1. Collins, F. S., & Varmus, H. (2015). "A New Initiative on Precision Medicine." New England Journal of Medicine, 372(9), 793-795.

https://www.nejm.org/doi/full/10.1056/NEJMp1500523

2. All of Us Research Program Investigators. (2019). "The 'All of Us' Research Program." New England Journal of Medicine, 381(7), 668-676.

https://www.nejm.org/doi/full/10.1056/NEJMsr1809937

3. Lander, E. S., & Collins, F. S. (2022). "The Next Phase of Human Genomics." New England Journal of Medicine, 386(20), 1922-1923.

https://www.nejm.org/doi/full/10.1056/NEJMp2205780

4. Kaye, J., et al. (2015). "Data Sharing in Genomics Reshaping Scientific Practice." Nature Reviews Genetics, 16(1), 7-15. https://www.nature.com/articles/nrg3863

5. Manolio, T. A., et al. (2015). "Implementing Genomic Medicine in the Clinic: The Future Is Here." Genetics in Medicine*, 17(7), 488-491.

https://www.nature.com/articles/gim2014106

6. Geisinger Health System. (2018). "MyCode Community Health Initiative."

https://www.geisinger.org/precision-health/mycode

7. UK Biobank Coordinating Centre. (2007). "UK Biobank: Protocol for a Large-Scale Prospective Epidemiological Resource." https://www.ukbiobank.ac.uk/media/gnkeyh2q/studyrationale.pdf

8. Denny, J. C., et al. (2019). "The 'All of Us' Research Program." New England Journal of Medicine, 381(7), 668-676. https://www.nejm.org/doi/full/10.1056/NEJMsr1809937

9. National Institutes of Health. (2020). "All of Us Research Program: Data Privacy and Security." https://allofus.nih.gov/about/program-overview/data-security-andprivacy

10. All of Us Research Program. “ National Institutes of Health.” U.S. Department of Health and Human Services. https://allofus.nih.gov/

Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.