September Announcement

 

Welcome to the September edition of the Data Science Initiative newsletter!

We’re excited to announce that the DSI has become an institutional member of the Academic Data Science Alliance (ADSA)! ADSA is a community of leaders, researchers, and educators who thoughtfully integrate data science and AI best practices in higher education. This membership opens up exciting new avenues for collaboration and growth within our data science community. If you would like to learn more about the ADSA, head to their site.

In other news, our recent Data Science Mixer for Recently Hired Faculty was a great success, helping to foster connections and introduce new faculty to the wealth of resources available at the university. Thank you to everyone who attended and contributed to this engaging event. See more below. 

Looking ahead, we have even more opportunities on the horizon, including the launch of our AI Makerspace Hours series, detailed information on WiADS, updates to the revamped Seed Grant program, and many events led not only by the DSI, but also by other data-focused centers and initiatives.

The DSI’s Goals and Supporting Programs


Featured Article/Celebrating Success 

Introducing Our New DSI Core Member! - Jie Ding Joins the Data Science Institute as a Core Member

We are excited to welcome Jie Ding as the newest Core Member of the Data Science Institute (DSI). An Associate Professor in the School of Statistics at the University of Minnesota, Jie’s research focuses on statistical learning, algorithmic decision-making, and artificial intelligence—areas that closely align with DSI's mission.

As a Core Member, Jie will contribute to shaping DSI’s strategy and programs, furthering our commitment to advancing data science through interdisciplinary collaboration. Core Members like Jie are essential to DSI, actively engaging in research that drives the institute’s vision forward.

Learn more about our Core Members and their roles within the DSI community. We look forward to the innovative contributions our Core Members will bring to the team. Please join us in welcoming him to the Data Science Initiative!

DSI Hosts Mixer for Recently Hired Faculty

The Data Science Initiative recently held a mixer aimed at connecting new faculty with the university's data science community. Attendees had the opportunity to network with colleagues, learn about key DSI resources such as the International Institute for Biosensing (IIB) and the Minnesota Supercomputing Institute (MSI), and explore funding opportunities like the seed grant program. A presentation showcasing these resources was shared to highlight ways to engage in ongoing research initiatives.

We invite faculty and staff to stay informed by checking the DSI event calendar regularly and invite you to read the full article on the DSI site, where they can view the presentation and explore images from the successful event.

Read the full article and see the presentation here.

 

Looking back to a DSI Data Voyages to hail the Honeycrisp!

On January 25th, the Data Science Initiative (DSI) hosted a seminar titled “The Science of Apple Cultivation,” offering an in-depth exploration of the processes behind apple breeding and innovation. If you missed it, you can access the recording on the DSI site. This event shed light on the University of Minnesota’s contributions to apple science, including the development of the beloved Honeycrisp.

According to a recent CFANS article titled “All hail the Honeycrisp!, 52 percent of Minnesotans claim the Honeycrisp as their favorite apple. The U of M’s apple breeding program has introduced 29 varieties to date, with Honeycrisp reigning supreme. The survey also found that 80 percent of Minnesotans prioritize purchasing locally grown fruits, showing their strong support for Minnesota agriculture.


Initiative Updates

Announcing New DSI Seed Grants!

The Data Science Initiative is excited to launch new seed grants to support innovative research in data science. These grants focus on key areas such as Robotics, Global Food, Environment, Brain Conditions, and Cancer Clinical Trials. Opportunities include Rapid Response Grants, DSI Faculty Fellowships, and Data Sets Grants, each offering significant funding and support. Research staff and faculty across all disciplines are encouraged to apply.  

For more details and application information, visit the Seed Grant Page on the DSI site 

 

SAIL 2024

The planning committee for SAIL 2024 is seeking panelists for the main conference sessions on October 8-9. This is an excellent opportunity to share your expertise, represent our Institute, and contribute to meaningful discussions shaping the future of AI research and leadership. 

Topics of interest to this group may include Institute Sustainability, Developing Educational Opportunities in AI Research, Overcoming AI Development Challenges, Foundational AI Research, BPC & AI, and Generative AI.

If you're interested in participating as a panelist, or if you'd like to recommend a colleague, please complete the Panelist Request Form for SAIL 2024. The form can be accessed here. AIVO is requesting submissions by early next week, if possible

Key Details:

  • Event: Summit for AI Institutes Leadership (SAIL) 2024

  • Dates: October 7-10, 2024

  • Location: Pittsburgh, PA

  • Panelist Opportunities: October 8-9, 2024

If you have any questions about the event, please don't hesitate to reach out to Shawn Haag at  [email protected].


 

Research Spotlight - Seed Grant Awardee

 

Title: Determination of Functional Drivers of Lewy Body Disease among the Intestinal Microbiota

PI(s): Levi Teigen, Christopher Staley

DSI Track: Digital Health

MnDRIVE Area(s): Brain Conditions

Summary Paragraph:

Lewy Body Disease (LBD) is the second most common neurodegenerative disorder after Alzheimer's Disease. Pathologically, LBD is characterized by  immunopositive α-synuclein aggregations in the form of Lewy bodies. Given this context, improving our understanding of what triggers α-synuclein aggregation, and promotes disease progression, is crucial. Gastrointestinal concerns (e.g. constipation) are an independent early clinical pre-motor/cognitive dysfunction feature of LBD, which places the gut and thus microbiome at the center of Lewy body research. In collaboration with neurology colleagues at Mayo Clinic, we identified a gut dysbiosis characterized by decreased short-chain fatty acid producing taxa in patients with LBD. Our central hypothesis is that dysbiosis underlying LBD results in microbial ecosystem services favoring the accumulation and propagation of abnormal α-synuclein. We will perform whole genome shotgun sequencing of DNA we previously analyzed by 16S rRNA amplicon sequencing to 1) evaluate differences in functional pathways between LBD patients, cohabitant controls, and healthy controls; and 2) develop a Bayesian model of directional interactions to build a model of disease development (i.e. predictive index). Our proposal promises novel insights into microbiome-targeted therapy for early-stage LBD and could mark a major advancement in the treatment of this devastating neurodegenerative disorder.

 

 

View 2023 Seed Grant Awardees

 


Events

 

AI Makerspace Hours

When: Every other Friday starting September 13, 2024

Where: Walter 575

The DSI and MSI invite all students, staff, and faculty to our AI Makerspace Hours, a unique event where you can dive into AI on our state of the art HPC with hands-on experience. With the support of our expert MSI staff, you'll learn everything from basic coding to training advanced generative AI models. Enjoy access to dedicated HPC nodes for practical learning and a set of comprehensive tutorials.

RSVPs are not required but highly recommended; otherwise, attendees will need to spend a few minutes creating an account on the HPC. Please bring your own laptop (it doesn't need to be a high-performance one). There will be one or two laptops available to loan out if needed. 

Don’t miss out on this opportunity to learn, explore, and innovate with us! RSVP Now!

 

WiADS

When: November 4th, 2024

Where: McNamara Alumni Center in Minneapolis

Mark your calendars for the Women in AI & Data Science Conference (WiADS), happening on November 4th, 2024, at the McNamara Alumni Center in Minneapolis. This free event is a unique opportunity to connect with leaders, experts, and enthusiasts in data science and AI, with a special emphasis on supporting women and non-binary individuals.

The conference will feature a diverse range of activities, including technical sessions, business-focused applications, engaging panel discussions, and ample networking opportunities. Whether you’re a seasoned professional or new to the field, WiADS offers something for everyone.

Don’t miss your chance to participate—the call for abstracts is still open! Submit your information todaySubmit your abstract today and be part of this inspiring event!

 

Distinguished Guest Seminar - Jeff Leek

When: October 21st, 2024 from 

Where: McNamara Alumni Center in Minneapolis

The DSI Distinguished Guest Seminar on October 21nd will feature Dr. Jeff Leek, Vice President and Chief Data Officer at Fred Hutch. Dr. Leek is a leading expert in biostatistics and data science, with significant contributions to research and data-driven strategies in public health. The seminar will be available for both in-person and remote viewing, providing an opportunity to gain valuable insights from one of the field's foremost authorities.

 

Data Discovery Across Departments 

Events in other departments/initiatives/institutions - (External (Non-DSI Events)

 

Minnesota Demography & Aging Seminar Series

All Seminars will be held in-person on Mondays from 12:15-1:15 PM CT in the Seminar Room in 50 Willey Hall. Snacks and beverages will be served. All Seminars will also be live streamed via Zoom.

For the most up-to-date information, view the full schedule on either the LLC website or the MPC website.

Fall 2024 Schedule 

  • SEPTEMBER 23

    A SHiPP ‘In the Offing’: Delivering an Early Analysis of a c19th and c20th Scottish Registers-based Database

    Peter Christen, SHiPP Research Lead, School of Geosciences at the University of Edinburgh and Chris Dibben, Chair in Geography, School of Geosciences at the University of Edinburgh 

  • OCTOBER 28

    Poor Quality Race/Ethnicity Data is Systemic Racism: Challenges and Solutions

    Stella Yi - Associate Professor, Department of Population Health, New York University

  • NOVEMBER 11

    Effect of System Affiliation on Hospital Choice: Evidence from Rural Markets

    Caitlin Carroll - Assistant Professor, Division of Health Policy & Management, University of Minnesota School of Public Health

  • DECEMBER 2

    Granular Income Inequality and Mobility using IDDA: Exploring Patterns across Race and Ethnicity

    Abigail Wozniak - Vice President at the Federal Reserve Bank of Minneapolis, Director of the Opportunity & Inclusive Growth Institut

     

Tools for organizing PDFs and article citations

When: September 9, 2024 | 1 - 2:30pm

Citation managers are tools to help you collect, organize, cite, and share research. This session is the second in a series of 5 and  will give an overview of various citation managers and then a deep dive into Zotero, a free, easy to use citation manager. We will also demonstrate how to format citations in a variety of styles and how to add in-text citations to Microsoft Word and Google Docs, as well as share citations with others using the group feature.

Join UMN Libraries this fall for a series of workshops to build and enhance your data management strategies. Topics will range from introductory skills to specific tool-based workshops on citation managers, backup workflows, the future of data sharing, and publishing your data. Sign up for a single workshop or join for the entire series. There is an opportunity to earn a Foundations of Data Management Badge by attending at least 3 out of 5 of the workshops below and completing assignments in the associated asynchronous Data Management Canvas course. 

Learn more about each workshop and register at z.umn.edu/dmbootcamp. Free to U of M graduate students!

Hosted by Research Data Services, a collaboration of University Libraries and LATIS.

 

2024 Advances in Learning Health System Sciences Conference

When: September 9–10, 2024

Where: Coffman Memorial Union

Discover how real-world data, health data science, and resultant real-world evidence can impact the future of healthcare. Bringing together interest holders of learning health systems throughout the state of Minnesota and beyond and supported by the University of Minnesota Data Science Initiative, the Clinical Translational Science Institute, and the Office of Academic Clinical Affairs,  the 2024 Advances in Learning Health System Sciences Conference is hosted by the Center for Learning Health System Sciences, a joint partnership of the University of Minnesota Medical School and School of Public Health. Engage with experts in healthcare policy, healthcare operations and innovation, applied data science, evidence-based care, and implementation science through thought-provoking sessions, workshops, and networking opportunities.

For more information and registration, visit the 
conference website.

Register ends August 16. Register here to secure your spot!

 

28th Annual IEEE High Performance Extreme Computing Virtual Conference

When: September 23rd to 27th, 2024

Where: Virtual

The 28th Annual IEEE High Performance Extreme Computing Virtual Conference is happening from September 23rd to 27th, 2024. They are seeking submissions showcasing advancements in High-Performance Computing, AI/Machine Learning, and more. 

Submission deadline: July 7, 2024. Don't miss this chance to contribute to cutting-edge computing technology! 

For details, visit the conference website.

(Learn more about IEEE)


Learning Resources

 

Libraries: Data Management Good Practices - Tools for organizing PDFs and article citations

WhenSeptember 9th 2024, 1:00 - 2:30pm

Citation managers are tools to help you collect, organize, cite, and share research. This session will give an overview of various citation managers and then a deep dive into Zotero, a free, easy to use citation manager. We will also demonstrate how to format citations in a variety of styles and how to add in-text citations to Microsoft Word and Google Docs, as well as share citations with others using the group feature.  

Register for Sept 9

 

Libraries: Data Management Good Practices - Storage, back-up, and versioning, oh my!

WhenOctober 2nd 2024, 1:00 - 2:30pm

Going beyond the best practices of "3-2-1", this session will demonstrate real workflows and tools for backing up and versioning your research data. We will cover a variety of techniques for protecting your data, from low/no tech to more technical coding based tools, such as GitHub. This session will also address recent Google Drive changes. No matter what your discipline or what materials you work with, this workshop will assist you in finding a storage strategy that is right for you!

Register for Oct 2

 

Libraries: Data Management Good Practices - The Future of Data Sharing: What You Need to Know About Federal Mandates

WhenOctober 15th 2024, 10:00 - 11:30 am

Recently, the Office of Science and Technology Policy (OSTP) released a memo requiring that all federal funding agencies adapt a data sharing policy for their grantees. The National Institutes of Health (NIH) is one agency that has already implemented this requirement, and all federal funders will do so by 2025. What does this mean for data management practices during the lifecycle of your research project? What must you write into a grant application? How do you know where to share your data? All of these topics and more will be explored in this session.

Register for Oct 15

 

Why Data Science?

Data Informs Decision-Making and Drives Innovation Data science is the study of data to extract meaningful insights.

Data Science is a multidisciplinary approach that combines principles and practices from the fields of mathematics, statistics, artificial intelligence, and computer engineering to analyze large amounts of information. GEMS Learning courses are modular data science education tailored to food, agriculture, and natural resource applications for working professionals and students. Across the curriculum, instructors have built their course content from their own work executing large-scale data science projects to solve pressing agricultural problems.

Fall 2024 Courses

  • NEW High Performance Computing for Agriculture (beta course for UMN affiliates only)

    If you are a researcher that works in the Agri-food domain (e.g., breeder, molecular biologist, food scientist, socioeconomist), you know a little bit of programming (e.g., in R and/or Python), but you feel a little limited (e.g., some of your calculations run for days on your laptop), then you could benefit from this course. We wish to show you how to step up to the next level, improve your coding efficiency, and make use of High Performance Computing (HPC) and Cloud resources readily available to you.

  • Breaking the Compute Barrier, Upskilling Agri-Food Researchers to Utilize HPC Resources, September 16 - November 18

 

  • Computing Basics for the Agri-food Sector (self paced)

    Are you a field or bench scientist and always wanted to feel more comfortable with your computing skills? These self-paced online courses are designed for those who have never used the command line, but realize that the responsibilities they have or will soon take on require them to automate tasks. Learn basic UNIX command-line skills, enable participants to work remotely on more powerful machines, create and run scripts to automate complex workflows, and synchronize your scripts with the larger community with Github.

  1. Introducing the GEMS platform  + Jupyter Lab
  2. Demystifying the UNIX command line
  3. Working Remotely and Scheduling Jobs on MSI’s systems 
  4. Synching your work with the community

 

  • Introduction to Data Analysis with R

    Two hour introductory workshop for those who are new to R and are interested in learning the basics of using R for data analysis

    Introduction to Data Analysis with R, October 30

  • Analyzing Spatial Agriculture data in R 

    Is accounting for spatial dependency in your analyses critical to your work?

    Or do you need to create a continuous surface of data (i.e., raster) based on a sample point date taken at selected locations?  

    Introduction to spatial data analysis in R, November 6, 13 and 20th

  • Explicitly Accounting for Location in Agriculture in Python (self paced)

    Learn how to work with spatial data in Python, starting from importing different spatial datasets and creating simple maps, to conducting basic geocomputation on vector and raster data. 

    Introduction to spatial data analysis in Python

    Spatial Regression in Python

 

Join the new Data Management Network! 

What is this? We are starting a network to connect research staff who work on data management in individual labs, departments, and units. This will help facilitate the sharing of resources and knowledge and help data managers link to broader technology resources across the university.

Who should join?: Individuals who engage with research data management within a lab, center, department, college or central research office. 

How to join? Fill out our interest form (https://z.umn.edu/join-dmn) to be added to the kick off event on September 10, 2024 as well as our slack and email groups. 

What do we mean by "Data Management"? The broad definition this group will take is data management as the "collection of practices, concepts, and processes that help maintain data integrity, quality, security, and usability throughout the data lifecycle. Data management best practices, policies, and tools help us apply the appropriately steward data from the moment it is created through the last day it is retained (https://data.wisc.edu/data-literacy/manage/)." Importantly, we want this group to be centered on individuals who manage research data in the scope of individual research projects, studies, groups, or centers and are focused on the stewardship of data to advance scholarship and knowledge. We anticipate most individuals will be engaging in broad data sharing as part of their stewardship of data. This includes both open and restricted data sharing. 

Questions? Please reach out to Shannon Farrell ([email protected]or Alicia Hofelich Mohr ([email protected]) with any questions. 


 

Funding Opportunities and Deadline

If you are interested in any of these or other data science related opportunities and need help organizing your submission or finding the right team please contact us, we’re here to help!

 

  • NSF MFAIMathematical Foundations in AI Deeper mathematical understanding is essential to ensuring that AI can be harnessed to meet the future needs of society and enable broad scientific discovery, while forestalling the unintended consequences of a disruptive technology. The overall goal is to establish innovative and principled design and analysis approaches for AI technology using creative yet theoretically grounded mathematical and statistical frameworks, yielding explainable and interpretable models that can enable sustainable, socially responsible, and trustworthy AI. Full proposal due: Oct. 10th, 2024
  • NIH AI and Patient SafetyThe NIH is interested in supporting healthcare safety by determining (1) whether and how certain breakthrough uses of Artificial Intelligence (AI) systems can affect patient safety; and (2) how AI systems can be safely implemented and used. AI has the potential to improve the safety, effectiveness, efficiency, accessibility, and affordability of healthcare. However, as with most technologies, this potential must be balanced by identifying and mitigating potential risks for patient harm and user burden. Sept. 25, 2024, next due date Jan. 25, 2025

 

Upcoming deadlines:

  • NEH: The Dangers and Opportunities of Technology: Perspectives from the Humanities. Due Sept. 12, 2024

 

For students:

  • The NSF PACK fellowship: The PACK fellowship is a graduate student opportunity to conduct research at the University of Kiel, Germany for 3 weeks. Applicants from any science or engineering discipline are encouraged to apply now! 

 


Open Positions and Assistantships

 


Social Media/Website Links

(LinkedIn) (Instagram) (YouTube)

Catchup on the Latest News at DSI

WiADS 2024 Conference: A Day of Inspiration and Connection for Women in Data Science

On November 4th, 2024, the University of Minnesota hosted the highly anticipated Women in AI and Data Science (WiADS) Conference, organized in partnership with MinneAnalytics at the McNamara Alumni Center. This sold-out event brought together over 1,000 registrants, including over 550 in-person attendees, and showcased the work of women, non-binary, and gender-diverse voices in data science.