Southwest Regional Superfund Summit Aims to Strengthen Data Science Collaborations and Training

Aug. 23, 2023

UA SRC DMAC.jpg

outdoor photo of group of Superfund researchers
Participants of the Southwest Regional Superfund Summit at the University of New Mexico

The University of Arizona Superfund Research Center (UA SRC) joined two other Superfund Research Centers funded by the National Institute of Environmental Health Sciences (NIEHS) to share their accomplishments and challenges in fulfilling the data management and analysis mandate of the NIEHS Superfund Program.

Scientists from the University of New Mexico, Texas A&M University, and the University of Arizona met in Albuquerque for a day-long summit to kick off a regional collaborative of Data Management and Analysis Cores (DMACs). This partnership will be identifying opportunities to enhance data science, research translation, training, and community engagement while simultaneously creating efficiencies in how the Centers accomplish their goals in a challenging funding environment.

The UA SRC DMAC is a national leader in creating open science tools for “big data”. The university houses CyVerse, an open data platform with over a hundred thousand users across the United States and beyond that provides scientists with data storage, bioinformatics tools, data visualization, interactive analyses, cloud services, APIs, and more. Our Center leverages CyVerse to manage and analyze data from research projects that focus on exposures, health impacts, and mitigation of hazardous dusts in drylands.

“CyVerse supports the management and integration of data assets across our Center, including establishing, coordinating, and monitoring processes for data management and analysis and identifying opportunities for integrating project/core-generated data with other existing datasets,” said Xinxin Ding, Center Director and Coit Professor and Department Head at the University of Arizona College of Pharmacy. “Our DMAC team, including Nirav Merchant, CyVerse PI and the Director of UA Data Science Institute, is eager to introduce CyVerse to our regional partners and the rest of the Superfund Program.” 

The University of New Mexico Superfund Center DMAC, which hosted the summit, has unique expertise in managing and analyzing the data from a number of projects aimed at addressing environmental concerns of indigenous communities. Strengths include enabling data-intensive projects with highly sensitive and privacy-protected environmental and population cohort data, as well as using geospatial approaches to report-back the results to the partner communities and the regulatory agencies to facilitate reduction in the burden of environmental pollution and ultimately improve the health of humans, animals and the environment.

According to Dr. Matthew Campen at the University of New Mexico, “The complexity of risks faced by our Indigenous community partners has led us to develop novel approaches for analysis of mixtures that include psychosocial, dietary, and physiological risks that combine with contaminant mixture exposures. We work within potentially conflicting needs of data security and privacy conferred by Tribal data sovereignty juxtaposed with NIH needs of FAIR datasets that allow for combined analyses of data collected from many investigators. Models and methods developed to meet those needs can apply to many other communities where environmental injustices and disparities combine to create similar needs.”

Texas A&M University Superfund Center DMAC has implemented a rigorous system for quality assurance of research resources, data and results. In addition, the Core offers a number of unique high-dimensional analysis approaches that allow for exploration of the linkages between environmental exposure and adverse health effects data on both individual chemicals and mixtures.

“Our Center’s focus on disaster research response creates a number of challenges with data management and analysis,” said Ivan Rusyn, Center Director and University Professor at Texas A&M University School of Veterinary Medicine and Biomedical Sciences. “The analytical solutions, focus on rigorous quality assurance and training are contributing not only to the success of our own projects, but benefit the larger environmental science community and our community and government partners.”

While each Center has unique strengths in data management and analysis, the summit identified a number of areas where the Centers can work together to leverage their strengths and to promote rigorous, open and collaborative environmental health science.

Dr. Matthew Campen (University of New Mexico) further pointed out, “Many of our graduate students and postdoctoral fellows could benefit from the combined expertise and tools the three programs have to offer. The diversity of research focus and applied data science approaches can create a more complete experience for our trainees.”

The Centers agreed to create a Southwest Superfund Consortium in Data Science, a mutually beneficial partnership where each Center’s DMAC agreed to establish communities of practice in data management/data science that deliver cross-center training, provide expert consultations, and share approaches and tools. 

Short- to medium-term priorities for the Consortium will be in the areas of data privacy and security, quality assurance, geospatial data visualization and modeling, expanding the utilization of CyVerse in the environmental science and engineering projects, and training in open data science tools.

While ambitious, the Consortium anticipates that these activities will not only create cross-Center collaborations and efficiencies, but also ultimately result in enhanced solutions to environmental health issues that are plaguing a number of indigenous and environmental justice communities across the Southwestern United States.

As noted by the Texas A&M Center’s Deputy Director, Weihsueh Chiu, “While there are of course differences across Centers, such as the more rural focus of UA and UNM compared to the more urban focus of TAMU, we believe the cross-fertilization of data science approaches across our centers will create synergies that ‘lift all boats’ in addressing the critical environmental health issues facing these communities.”

Related Cores