September Edition
September Announcement
Welcome to the September edition of the Data Science Initiative newsletter!
We’re excited to announce that the DSI has become an institutional member of the Academic Data Science Alliance (ADSA)! ADSA is a community of leaders, researchers, and educators who thoughtfully integrate data science and AI best practices in higher education. This membership opens up exciting new avenues for collaboration and growth within our data science community. If you would like to learn more about the ADSA, head to their site.
In other news, our recent Data Science Mixer for Recently Hired Faculty was a great success, helping to foster connections and introduce new faculty to the wealth of resources available at the university. Thank you to everyone who attended and contributed to this engaging event. See more below.
Looking ahead, we have even more opportunities on the horizon, including the launch of our AI Makerspace Hours series, detailed information on WiADS, updates to the revamped Seed Grant program, and many events led not only by the DSI, but also by other data-focused centers and initiatives.
The DSI’s Goals and Supporting Programs
Featured Article/Celebrating Success
Introducing Our New DSI Core Member! - Jie Ding Joins the Data Science Institute as a Core Member
We are excited to welcome Jie Ding as the newest Core Member of the Data Science Institute (DSI). An Associate Professor in the School of Statistics at the University of Minnesota, Jie’s research focuses on statistical learning, algorithmic decision-making, and artificial intelligence—areas that closely align with DSI's mission.
As a Core Member, Jie will contribute to shaping DSI’s strategy and programs, furthering our commitment to advancing data science through interdisciplinary collaboration. Core Members like Jie are essential to DSI, actively engaging in research that drives the institute’s vision forward.
Learn more about our Core Members and their roles within the DSI community. We look forward to the innovative contributions our Core Members will bring to the team. Please join us in welcoming him to the Data Science Initiative!
DSI Hosts Mixer for Recently Hired Faculty
The Data Science Initiative recently held a mixer aimed at connecting new faculty with the university's data science community. Attendees had the opportunity to network with colleagues, learn about key DSI resources such as the International Institute for Biosensing (IIB) and the Minnesota Supercomputing Institute (MSI), and explore funding opportunities like the seed grant program. A presentation showcasing these resources was shared to highlight ways to engage in ongoing research initiatives.
We invite faculty and staff to stay informed by checking the DSI event calendar regularly and invite you to read the full article on the DSI site, where they can view the presentation and explore images from the successful event.
Read the full article and see the presentation here.
Looking back to a DSI Data Voyages to hail the Honeycrisp!
On January 25th, the Data Science Initiative (DSI) hosted a seminar titled “The Science of Apple Cultivation,” offering an in-depth exploration of the processes behind apple breeding and innovation. If you missed it, you can access the recording on the DSI site. This event shed light on the University of Minnesota’s contributions to apple science, including the development of the beloved Honeycrisp.
According to a recent CFANS article titled “All hail the Honeycrisp!”, 52 percent of Minnesotans claim the Honeycrisp as their favorite apple. The U of M’s apple breeding program has introduced 29 varieties to date, with Honeycrisp reigning supreme. The survey also found that 80 percent of Minnesotans prioritize purchasing locally grown fruits, showing their strong support for Minnesota agriculture.
Initiative Updates
Announcing New DSI Seed Grants!
The Data Science Initiative is excited to launch new seed grants to support innovative research in data science. These grants focus on key areas such as Robotics, Global Food, Environment, Brain Conditions, and Cancer Clinical Trials. Opportunities include Rapid Response Grants, DSI Faculty Fellowships, and Data Sets Grants, each offering significant funding and support. Research staff and faculty across all disciplines are encouraged to apply.
For more details and application information, visit the Seed Grant Page on the DSI site
SAIL 2024
The planning committee for SAIL 2024 is seeking panelists for the main conference sessions on October 8-9. This is an excellent opportunity to share your expertise, represent our Institute, and contribute to meaningful discussions shaping the future of AI research and leadership.
Topics of interest to this group may include Institute Sustainability, Developing Educational Opportunities in AI Research, Overcoming AI Development Challenges, Foundational AI Research, BPC & AI, and Generative AI.
If you're interested in participating as a panelist, or if you'd like to recommend a colleague, please complete the Panelist Request Form for SAIL 2024. The form can be accessed here. AIVO is requesting submissions by early next week, if possible.
Key Details:
Event: Summit for AI Institutes Leadership (SAIL) 2024
Dates: October 7-10, 2024
Location: Pittsburgh, PA
Panelist Opportunities: October 8-9, 2024
If you have any questions about the event, please don't hesitate to reach out to Shawn Haag at [email protected].
Research Spotlight - Seed Grant Awardee
Title: Determination of Functional Drivers of Lewy Body Disease among the Intestinal Microbiota PI(s): Levi Teigen, Christopher Staley DSI Track: Digital Health MnDRIVE Area(s): Brain Conditions Summary Paragraph: Lewy Body Disease (LBD) is the second most common neurodegenerative disorder after Alzheimer's Disease. Pathologically, LBD is characterized by immunopositive α-synuclein aggregations in the form of Lewy bodies. Given this context, improving our understanding of what triggers α-synuclein aggregation, and promotes disease progression, is crucial. Gastrointestinal concerns (e.g. constipation) are an independent early clinical pre-motor/cognitive dysfunction feature of LBD, which places the gut and thus microbiome at the center of Lewy body research. In collaboration with neurology colleagues at Mayo Clinic, we identified a gut dysbiosis characterized by decreased short-chain fatty acid producing taxa in patients with LBD. Our central hypothesis is that dysbiosis underlying LBD results in microbial ecosystem services favoring the accumulation and propagation of abnormal α-synuclein. We will perform whole genome shotgun sequencing of DNA we previously analyzed by 16S rRNA amplicon sequencing to 1) evaluate differences in functional pathways between LBD patients, cohabitant controls, and healthy controls; and 2) develop a Bayesian model of directional interactions to build a model of disease development (i.e. predictive index). Our proposal promises novel insights into microbiome-targeted therapy for early-stage LBD and could mark a major advancement in the treatment of this devastating neurodegenerative disorder.
|
Events
AI Makerspace Hours
When: Every other Friday starting September 13, 2024
Where: Walter 575
The DSI and MSI invite all students, staff, and faculty to our AI Makerspace Hours, a unique event where you can dive into AI on our state of the art HPC with hands-on experience. With the support of our expert MSI staff, you'll learn everything from basic coding to training advanced generative AI models. Enjoy access to dedicated HPC nodes for practical learning and a set of comprehensive tutorials.
RSVPs are not required but highly recommended; otherwise, attendees will need to spend a few minutes creating an account on the HPC. Please bring your own laptop (it doesn't need to be a high-performance one). There will be one or two laptops available to loan out if needed.
Don’t miss out on this opportunity to learn, explore, and innovate with us! RSVP Now!
WiADS
When: November 4th, 2024
Where: McNamara Alumni Center in Minneapolis
Mark your calendars for the Women in AI & Data Science Conference (WiADS), happening on November 4th, 2024, at the McNamara Alumni Center in Minneapolis. This free event is a unique opportunity to connect with leaders, experts, and enthusiasts in data science and AI, with a special emphasis on supporting women and non-binary individuals.
The conference will feature a diverse range of activities, including technical sessions, business-focused applications, engaging panel discussions, and ample networking opportunities. Whether you’re a seasoned professional or new to the field, WiADS offers something for everyone.
Don’t miss your chance to participate—the call for abstracts is still open! Submit your information today. Submit your abstract today and be part of this inspiring event!
Distinguished Guest Seminar - Jeff Leek
When: October 21st, 2024 from
Where: McNamara Alumni Center in Minneapolis
The DSI Distinguished Guest Seminar on October 21nd will feature Dr. Jeff Leek, Vice President and Chief Data Officer at Fred Hutch. Dr. Leek is a leading expert in biostatistics and data science, with significant contributions to research and data-driven strategies in public health. The seminar will be available for both in-person and remote viewing, providing an opportunity to gain valuable insights from one of the field's foremost authorities.
Data Discovery Across Departments
Events in other departments/initiatives/institutions - (External (Non-DSI Events)
Minnesota Demography & Aging Seminar Series
All Seminars will be held in-person on Mondays from 12:15-1:15 PM CT in the Seminar Room in 50 Willey Hall. Snacks and beverages will be served. All Seminars will also be live streamed via Zoom.
For the most up-to-date information, view the full schedule on either the LLC website or the MPC website.
Fall 2024 Schedule
SEPTEMBER 23
A SHiPP ‘In the Offing’: Delivering an Early Analysis of a c19th and c20th Scottish Registers-based Database
Peter Christen, SHiPP Research Lead, School of Geosciences at the University of Edinburgh and Chris Dibben, Chair in Geography, School of Geosciences at the University of EdinburghOCTOBER 28
Poor Quality Race/Ethnicity Data is Systemic Racism: Challenges and Solutions
Stella Yi - Associate Professor, Department of Population Health, New York UniversityNOVEMBER 11
Effect of System Affiliation on Hospital Choice: Evidence from Rural Markets
Caitlin Carroll - Assistant Professor, Division of Health Policy & Management, University of Minnesota School of Public HealthDECEMBER 2
Granular Income Inequality and Mobility using IDDA: Exploring Patterns across Race and Ethnicity
Abigail Wozniak - Vice President at the Federal Reserve Bank of Minneapolis, Director of the Opportunity & Inclusive Growth Institut
Tools for organizing PDFs and article citations
When: September 9, 2024 | 1 - 2:30pm
Citation managers are tools to help you collect, organize, cite, and share research. This session is the second in a series of 5 and will give an overview of various citation managers and then a deep dive into Zotero, a free, easy to use citation manager. We will also demonstrate how to format citations in a variety of styles and how to add in-text citations to Microsoft Word and Google Docs, as well as share citations with others using the group feature.
Join UMN Libraries this fall for a series of workshops to build and enhance your data management strategies. Topics will range from introductory skills to specific tool-based workshops on citation managers, backup workflows, the future of data sharing, and publishing your data. Sign up for a single workshop or join for the entire series. There is an opportunity to earn a Foundations of Data Management Badge by attending at least 3 out of 5 of the workshops below and completing assignments in the associated asynchronous Data Management Canvas course.
Learn more about each workshop and register at z.umn.edu/dmbootcamp. Free to U of M graduate students!
Hosted by Research Data Services, a collaboration of University Libraries and LATIS.
2024 Advances in Learning Health System Sciences Conference
When: September 9–10, 2024
Where: Coffman Memorial Union
Discover how real-world data, health data science, and resultant real-world evidence can impact the future of healthcare. Bringing together interest holders of learning health systems throughout the state of Minnesota and beyond and supported by the University of Minnesota Data Science Initiative, the Clinical Translational Science Institute, and the Office of Academic Clinical Affairs, the 2024 Advances in Learning Health System Sciences Conference is hosted by the Center for Learning Health System Sciences, a joint partnership of the University of Minnesota Medical School and School of Public Health. Engage with experts in healthcare policy, healthcare operations and innovation, applied data science, evidence-based care, and implementation science through thought-provoking sessions, workshops, and networking opportunities.
For more information and registration, visit the conference website.
Register ends August 16. Register here to secure your spot!
28th Annual IEEE High Performance Extreme Computing Virtual Conference
When: September 23rd to 27th, 2024
Where: Virtual
The 28th Annual IEEE High Performance Extreme Computing Virtual Conference is happening from September 23rd to 27th, 2024. They are seeking submissions showcasing advancements in High-Performance Computing, AI/Machine Learning, and more.
Submission deadline: July 7, 2024. Don't miss this chance to contribute to cutting-edge computing technology!
For details, visit the conference website.
Learning Resources
Libraries: Data Management Good Practices - Tools for organizing PDFs and article citations
When: September 9th 2024, 1:00 - 2:30pm
Citation managers are tools to help you collect, organize, cite, and share research. This session will give an overview of various citation managers and then a deep dive into Zotero, a free, easy to use citation manager. We will also demonstrate how to format citations in a variety of styles and how to add in-text citations to Microsoft Word and Google Docs, as well as share citations with others using the group feature.
Libraries: Data Management Good Practices - Storage, back-up, and versioning, oh my!
When: October 2nd 2024, 1:00 - 2:30pm
Going beyond the best practices of "3-2-1", this session will demonstrate real workflows and tools for backing up and versioning your research data. We will cover a variety of techniques for protecting your data, from low/no tech to more technical coding based tools, such as GitHub. This session will also address recent Google Drive changes. No matter what your discipline or what materials you work with, this workshop will assist you in finding a storage strategy that is right for you!
Libraries: Data Management Good Practices - The Future of Data Sharing: What You Need to Know About Federal Mandates
When: October 15th 2024, 10:00 - 11:30 am
Recently, the Office of Science and Technology Policy (OSTP) released a memo requiring that all federal funding agencies adapt a data sharing policy for their grantees. The National Institutes of Health (NIH) is one agency that has already implemented this requirement, and all federal funders will do so by 2025. What does this mean for data management practices during the lifecycle of your research project? What must you write into a grant application? How do you know where to share your data? All of these topics and more will be explored in this session.
Why Data Science?
Data Informs Decision-Making and Drives Innovation Data science is the study of data to extract meaningful insights.
Data Science is a multidisciplinary approach that combines principles and practices from the fields of mathematics, statistics, artificial intelligence, and computer engineering to analyze large amounts of information. GEMS Learning courses are modular data science education tailored to food, agriculture, and natural resource applications for working professionals and students. Across the curriculum, instructors have built their course content from their own work executing large-scale data science projects to solve pressing agricultural problems.
Fall 2024 Courses
NEW High Performance Computing for Agriculture (beta course for UMN affiliates only)
If you are a researcher that works in the Agri-food domain (e.g., breeder, molecular biologist, food scientist, socioeconomist), you know a little bit of programming (e.g., in R and/or Python), but you feel a little limited (e.g., some of your calculations run for days on your laptop), then you could benefit from this course. We wish to show you how to step up to the next level, improve your coding efficiency, and make use of High Performance Computing (HPC) and Cloud resources readily available to you.
- Breaking the Compute Barrier, Upskilling Agri-Food Researchers to Utilize HPC Resources, September 16 - November 18
Computing Basics for the Agri-food Sector (self paced)
Are you a field or bench scientist and always wanted to feel more comfortable with your computing skills? These self-paced online courses are designed for those who have never used the command line, but realize that the responsibilities they have or will soon take on require them to automate tasks. Learn basic UNIX command-line skills, enable participants to work remotely on more powerful machines, create and run scripts to automate complex workflows, and synchronize your scripts with the larger community with Github.
- Introducing the GEMS platform + Jupyter Lab
- Demystifying the UNIX command line
- Working Remotely and Scheduling Jobs on MSI’s systems
- Synching your work with the community
Introduction to Data Analysis with R
Two hour introductory workshop for those who are new to R and are interested in learning the basics of using R for data analysis
Introduction to Data Analysis with R, October 30
Analyzing Spatial Agriculture data in R
Is accounting for spatial dependency in your analyses critical to your work?
Or do you need to create a continuous surface of data (i.e., raster) based on a sample point date taken at selected locations?
Introduction to spatial data analysis in R, November 6, 13 and 20th
Explicitly Accounting for Location in Agriculture in Python (self paced)
Learn how to work with spatial data in Python, starting from importing different spatial datasets and creating simple maps, to conducting basic geocomputation on vector and raster data.
Join the new Data Management Network!
What is this? We are starting a network to connect research staff who work on data management in individual labs, departments, and units. This will help facilitate the sharing of resources and knowledge and help data managers link to broader technology resources across the university.
Who should join?: Individuals who engage with research data management within a lab, center, department, college or central research office.
How to join? Fill out our interest form (https://z.umn.edu/join-dmn) to be added to the kick off event on September 10, 2024 as well as our slack and email groups.
What do we mean by "Data Management"? The broad definition this group will take is data management as the "collection of practices, concepts, and processes that help maintain data integrity, quality, security, and usability throughout the data lifecycle. Data management best practices, policies, and tools help us apply the appropriately steward data from the moment it is created through the last day it is retained (https://data.wisc.edu/data-literacy/manage/)." Importantly, we want this group to be centered on individuals who manage research data in the scope of individual research projects, studies, groups, or centers and are focused on the stewardship of data to advance scholarship and knowledge. We anticipate most individuals will be engaging in broad data sharing as part of their stewardship of data. This includes both open and restricted data sharing.
Questions? Please reach out to Shannon Farrell ([email protected]) or Alicia Hofelich Mohr ([email protected]) with any questions.
Funding Opportunities and DeadlineIf you are interested in any of these or other data science related opportunities and need help organizing your submission or finding the right team please contact us, we’re here to help!
Upcoming deadlines:
For students:
|
Open Positions and Assistantships
Social Media/Website Links
Catchup on the Latest News at DSI
Congratulations to Dr. Genevieve Melton-Meaux on Her Election to the National Academy of Medicine
WiADS 2024 Conference: A Day of Inspiration and Connection for Women in Data Science
On November 4th, 2024, the University of Minnesota hosted the highly anticipated Women in AI and Data Science (WiADS) Conference, organized in partnership with MinneAnalytics at the McNamara Alumni Center. This sold-out event brought together over 1,000 registrants, including over 550 in-person attendees, and showcased the work of women, non-binary, and gender-diverse voices in data science.