Non-Research Projects (Virtual)
ABLE Creative Program Assistant
Ange Mason, Education Program Manager, San Diego Supercomputer Center, UCSD
If you are interested in computer science/engineering and enjoys event planning, then this internship is just for you! Working with the program manager, you will plan the ABLE program and implement a recruitment strategy for the coming year. Your work will include building an event calendar full of speaker conferences, campus lab tours, community activities, hands-on-workshops, and department lectures appropriate for this program. Additionally, you will plan for a meet up events for the ABLE participants. This internship will give you the rare opportunity to meet and network with campus faculties and community leaders as a high school student. It may be challenging at times when you have to plan multiple events simultaneously, but this will build your organizational skills. Apply today to plan some new, creative, and exciting events for the UCSD ABLE program!
Number of Students Requested: 1–2
Number of hours per week: 10–15 hours. (There is a possibility that this internship could be extended through December 31, 2023)
Plan to Integrate Student into Group Activity:
The student will be a part of the summer student staff working alongside students involved in our other two summer REHS projects. Group meetings will be held twice a week either in person or over Zoom. The goal of ABLE summer programs is to enhance our current educational content with research your videos, enhanced web design and marketing and new planned events.
Student Prerequisite:
We are looking for a student who is self-driven, able to work in independent/group settings, and enjoy collaborating with a diverse population. Excellent written and oral skills are a must. Leadership experiences and WordPress skills will be a plus, but not required. Please display your required organizational skills in your application by providing examples of events that you have planned in your essay/interview. If you do not have planning experience, please research the ABLE program and outline events that you think could enhance this program.
Relevant Links:
- San Diego Supercomputer Center (SDSC): http://www.sdsc.edu
- SDSC ABLE Program: https://education.sdsc.edu/studenttech/able
- SDSC StudentTECH program: http://education.sdsc.edu/studenttech
Ange Mason
Education Program Manager, San Diego Supercomputer Center, UCSD
UCSD Campus Education Community intern
Ange Mason, Education Program Manager, San Diego Supercomputer Center, UCSD
Do you love working with students in a fun and informal setting? Do you like taking photos and being creative with Zoom? Then this summer opportunity is for you! Help the San Diego Supercomputer Center (SDSC) highlight our summer camp program to share with next year’s Summer student applicants and the community.
Wouldn’t it be helpful to be able to view a short video or view a photo album about a popular summer program before you apply to the program? We think so, too! Help us with this project and have your work immortalized on our SDSC webpages for years to come!
The successful student candidate will meet regularly with his/her mentor during the week to brainstorm and strategize successful video projects and photo albums. Join us for a fun and informative summer!r to create a compelling piece of work. Join us and take the challenge!
Number of Students Requested: 1–2
Number of hours per week: 10–15 hours
Plan to Integrate Student into Group Activity:
The student will attend the group meetings and share in weekly planning and logistical meetings. The student will work closely with the project lead and the other Education team members.
Student Prerequisite:
We are looking for a student who is self-driven and able to work with minimal supervision. Student must be customer service oriented and like working with a diverse population of students.
Interested applicants must have strong video creation knowledge(filming + editing). For a hint as to the skills that we are looking for, please visit this link: https://www.youtube.com/results?search_query=SDSCTV.
Relevant Links:
- Past student work: https://education.sdsc.edu/studenttech/?page_id=570
- San Diego Supercomputer Center: http://www.sdsc.edu/
- San Diego Supercomputer Center StudentTECH program: https://education.sdsc.edu/studenttech
Ange Mason
Education Program Manager, San Diego Supercomputer Center, UCSD
SDSC Education Website Development intern
Ange Mason, Education Program Manager, San Diego Supercomputer Center, UCSD
The San Diego Supercomputer Center (SDSC) Education website has changed drastically over the last two years and now presents a sleek and modern design. Two items that are missing include and event calendar and a photo gallery. The current design was created by two former REHS students so please do not be nervous about applying for this opportunity.
The website was created using WordPress. WordPress skills are a MUST. If you are skilled in other website development software and feel that you can master WordPress without difficulty, please be prepared to share a single WordPress page during the interview process.
Students must possess:
Excellent written and oral communication skills
Previous experience working on a website or creating a website.
Demonstrated proficiency with WordPress and/or demonstrated ability
Number of Students Requested: 1–2
Number of hours per week: To be arranged (10-15)
Plan to Integrate Student into Group Activity:
The intern will attend the group meetings and share in weekly planning and logistical meetings. The student will work closely with the project lead and the other Education team members.
Student Prerequisite:
We are looking for a student who is self-driven and able to work with minimal supervision. The intern must be customer service oriented and able to work with a diverse work setting. Problem solving skills are a must. Creativity and innovative ideas are desired. The intern must possess excellent written and oral skills.
Relevant Links:
- San Diego Supercomputer Center (SDSC): http://www.sdsc.edu
- SDSC StudentTECH program: http://education.sdsc.edu/studenttech
Ange Mason
Education Program Manager, San Diego Supercomputer Center, UCSD
Research Projects (Hybrid)
Metabolic Profiles of Cancer Diagnostics
Dr. Valentina Kouznetsova, Project Scientist, San Diego Supercomputer Center, UCSD
Dr. Igor Tsigelny, Department of Neurosciences, San Diego Supercomputer Center, UCSD
Cancer is the second leading cause of death in the United States, and responsible for 580,350 deaths in 2013, it has become ever more imperative that researchers focus their attention in this field. Ingenuity Integrated Pathway Analysis (IPA) program is used to build and explore transcriptional networks, microRNA-mRNA target networks, phosphorylation cascades, and Protein-Protein or Protein-DNA interactions. We try to identify regulatory effects that lead from signaling events to transcriptional events for causal network analysis and to construct a comprehensive chart of diseases and functions associated with target genes so as to elucidate gene functionality and possible points of drug administration.
Number of Students Requested: 1
Number of hours per week: 15-20 hours.
Plan to Integrate Student into Group Activity:
The student will be a part of the research team working on the larger scale project that includes the project described here. He/she will attend the group meetings and communicate with the team members using other methods of communication. The student will work closely with the lead person and the other personnel involved.
Student Prerequisite:
We are looking for a student who is interested in both biology and computation.
Relevant Links:
- San Diego Supercomputer Center (SDSC): http://www.sdsc.edu
Dr. Valentina Kouznetsova
Project Scientist, San Diego Supercomputer Center, UCSD
Click here to learn more about Dr. Kouznetsova.
Dr. Igor Tsigelny
Department of Neurosciences, San Diego Supercomputer Center, UCSD
Writer for Science Gateways Stories
Claire Stirm, San Diego Supercomputer Center, UCSD
Are you interested in learning more about how researchers use websites and academic systems for their research? Are you interested in helping write about these researchers and highlighting them into a future published storybook? Then this summer opportunity is for you! Help the Science Gateways Center of Excellence (SGX3) write about science gateways and how they are solving research challenges and teaching the next generation of researchers.
Tasks will include conducting your own research about the science gateway, including looking at research papers and information on the science gateway’s public site; writing down findings in a digital document to be shared with your mentor; interviewing members of the SGX3 team to learn how SGX3 has supported this science gateway through consulting services; and writing a story about the science gateway. During the summer we will create several stories with the opportunity to prepare several stories for a future conference paper and to be included in our future storybook talking about outcomes of these science gateways.
The successful student candidate(s) will meet regularly with their mentor to review tasks and provide suggestions and feedback. The student candidate(s) will also have access to other communication channels to talk to other members of the SGX3 team via Slack and email.
Number of Students Requested: 1-2
Number of hours per week: 15-25 hours.
Names, organizations, and roles of others who may provide additional mentoring for the student:
- Sandra Gesing, SDSC, Community & Outreach Lead for SGX3
- David Montoya, TACC, Website Developer Lead for SGX3
A description of the plan to integrate the student researcher into the group’s activities:
The student will attend the meetings with their mentor and join some group meetings with members of the SGX3 team. The student will work with their mentor to define weekly goals and actions. At the end of each week, the student will share back with the mentor outcomes and problem-solve on any challenges.
List of student prerequisites for the research project:
We are looking for a student who has excellent communication skills, self-driven, and able to work with light supervision. The student must be customer service oriented and like working with a diverse population of academic professionals.
For a hint as to the skills that we are looking for, please see our previous storybook at https://sciencegateways.org/app/site/media/files/SGCI_storybook_interactive_v2022-12-13.pdf
If you have interest or these bonus skills – a bonus but not required:
- Graphic design
- HTML
- Knowledge or experience interacting with research applications
Relevant Links:
- Science Gateways Website: https://sciencegateways.org/
- Science Gateways Storybook: https://sciencegateways.org/app/site/media/files/SGCI_storybook_interactive_v2022-12-13.pdf
Claire Stirm: Project Manager, San Diego Supercomputer Center, UCSD
Developing Interactive Jupyter Notebooks to run on the ICICLE project
Mary Thomas, Ph.D., Expanse and TSCC Training lead, Computational Data Scientist in the Data-Enabled Scientific Computing Division
This project involves running Jupyter Notebooks on the NSF funded Expanse high-performance (HPC) system [1] and testing software to be used on the NSF funded ICICLE AI project [3]. Expanse is SDSC’s newest supercomputer. The result of a $10M National Science Foundation (NSF) award, Expanse delivers over 5,2 peak petaFlOps of computing power to scientists, engineers, and researchers all around the world [2]. Expanse provides three kinds of HPC/CI resources: General Computing Nodes, NVIDIA GPU Nodes, and the petascale Luster filesystem. Thousands of users have accessed these high-performance computing (HPC) resources via traditional runs from the command line and using batch queuing systems.
The National Science Foundation funded AI Institute for Intelligent Cyberinfrastructure with Computational Learning in the Environment (ICICLE) [3] will build the next generation of Cyberinfrastructure to render Artificial Intelligence (AI) more accessible to everyone and drive its further democratization in the larger society. ICICLE will develop intelligent cyberinfrastructure with transparent and high-performance execution on diverse and heterogeneous environments. It will advance plug-and-play AI that is easy to use by scientists across a wide range of domains, promoting the democratization of AI.
Scientists using HPC Systems working with interactive HPC tools such as Jupyter notebooks to implement computational and data analysis functions and workflows [5]. Jupyter notebooks are web applications that allow you to create and share documents that contain live code, equations, visualizations and narrative text. These notebooks part of a general trend in research computing away from command-line style interfaces and towards browser-based and graphical interfaces. Jupyter notebooks are especially useful for interactivity: the development, testing, and exploration of data sets or as an instructional resource [6]. Users working interactively expect a timely response, both for initial application startup and during the course of a session.
The goals of this research project are:
- To test and develop Jupyter Notebooks that run software components developed by the ICICLE project.
- To learn the basics of Jupyter notebooks, AI, and Knowledge Graphs.
The research components are:
- Contribute to the body of knowledge needed for hosting live, dynamic, interactive services that interface to HPC systems
- Learn how to develop interactive notebooks to be used for education and training of the users of these systems
Number of students to be supported: 2-4
Number of hours per week: 15-25 hours
Plan to integrate student into group activity
Prior to beginning the REHS program, the selected student team members will be provided with recommended programming exercises to help build the skills they will need to learn in order to successfully complete this project. Dr. Thomas and other mentors will be available via email to provide guidance to the students on how to approach these exercises. During the first week of the REHS program, the student team will then work closely with Dr. Thomas and other mentors to build a research plan that clearly defines the milestones of the project in order to meet its goals. In addition, the students will have the opportunity to interact with other REHS students and undergraduate or graduate interns that will be working on similar projects.
Student Prerequisites
Applicants must have a demonstrated interest in computer science and mathematics. In addition, some previous experience in: Jupyter Notebooks; some exposure to Artificial Intelligence (AI) methods; programming in Python (preferred); exposure to the Linux/Unix operating system.
IMPORTANT APPLICANT CRITERIA – Please DO NOT apply if you:
- Cannot participate via Zoom or be on campus during most of the committed hours per week;
- Have more than one week of family vacation planned during the internship period;
- Have accepted another internship, either inside or outside of SDSC;
- Have a FT job during the internship position;
- Are taking an SAT prep class or other such course where course hours could conflict with summer internship hours.
Relevant Links
- https://www.sdsc.edu/News%20Items/PR20190716_Expanse.html
- https://www.sdsc.edu/services/hpc/expanse/
- The Intelligent Cyberinfrastructure with Computational Learning in the Environment (ICICLE) Project: https://icicle.osu.edu/
- http://www.ic-foods.org/
- https://www.tacc.utexas.edu/home
- The Jupyter Notebook Project Website, https://jupyter.org/
- Zonca, A. and R.S. Sinkovits, Deploying Jupyter Notebooks at scale on XSEDE for Science Gateways and workshops. Available at: https://zonca.github.io/docs/pearc18_slides_zonca_sinkovits.pdf
Mary Thomas, Ph.D.
Expanse and TSCC Training lead, and Computational Data Scientist in the Data-Enabled Scientific Computing Division
Click here to learn more about Dr. Thomas.
Additional mentors
- Matthew Lange, Ph.D., CEO, International Center for Food Ontology Operability Data and Semantics (IC-FOODS [4])
- Joe Stubbs, Ph.D., Manager, Cloud & Interactive Computing (Texas Advanced Computing Center [5])
- Andreas Goetz, Ph.D, Director, Computational Chemistry Laboratory
- Robert Sinkovitz, Ph.D., Director of Education (SDSC)
Molecular Mechanisms of Diseases: cancer, autism, Alzheimer’s disease
Dr. Valentina Kouznetsova, Project Scientist, San Diego Supercomputer Center, UCSD
Dr. Igor Tsigelny, Dept. of Neurosciences, San Diego Supercomputer Center, UCSD
Autistic Spectrum Disorders (ASD):
ASD is a neurodevelopmental disorder that includes social impairments, restrictive and repetitive behavioral patterns, and communication difficulties. Scientists have not found the cause for autism, however ASD is associated with both genetic and environmental factors. Autism is predicted to be caused by genetic mutations in critical neurological areas of the genome due to its etiological heterogeneity and likelihood in siblings. Specific genes and genomic loci have been located and implicated in ASD development. We study these gene interactions with the transcription factors that activate them and to find the spots of possible drug administration, molecular modeling and molecular dynamics simulations.
Alzheimer’s:
Alzheimer’s Disease is a neurodegenerative disorder that affects a variety of the brain’s functions, starting with short term memory loss, and eventually progressing to mood swings, disorientation, depression, and even loss of bodily functions. It is responsible for 60-70% of all dementia cases, and is the sixth leading cause of death in America. It is characterized by a buildup of β-amyloid plaques in the brain. There is no known cure for the disorder today, but recently, the amyloid hypothesis has been heavily researched as the mechanism of the disease. The amyloid precursor protein (APP) is cut by two enzymes, β-secretase and γ-secretase, and β-amyloid, the toxic form of the protein, is one of the segments that remains. Since the marker of Alzheimer’s is the buildup of this plaque in the brain, there is potential for slowing the progression of the disease if production of this protein is inhibited. We study existing drugs inactions with the target proteins and try to find methods to improve their action.
Cancer:
Cancer is the second leading cause of death in the United States, and responsible for 580,350 deaths in 2013, it has become ever more imperative that researchers focus their attention in this field. Ingenuity Integrated Pathway Analysis (IPA) program is used to build and explore transcriptional networks, microRNA-mRNA target networks, phosphorylation cascades, and Protein-Protein or Protein-DNA interactions. We try to identify regulatory effects that lead from signaling events to transcriptional events for causal network analysis and to construct a comprehensive chart of diseases and functions associated with target genes so as to elucidate gene functionality and possible points of drug administration.
Number of Students Requested: 3
Number of hours per week: 15-20 hours.
Plan to Integrate Student into Group Activity:
The students will be a part of the research team working on the larger scale project that includes the project described here. They will attend the group meetings and communicate with the team members using other methods of communication. The student will work closely with the lead person and the other personnel involved.
Student Prerequisite:
We are looking for a student who is interested in both biology and computation.
Relevant Links:
- San Diego Supercomputer Center (SDSC): http://www.sdsc.edu
Dr. Valentina Kouznetsova
Project Scientist, San Diego Supercomputer Center, UCSD
Click here to learn more about Dr. Valentina Kouznetsova.
Dr. Igor Tsigelny
Department of Neurosciences, San Diego Supercomputer Center, UCSD
UCSD EarthCube Science Writer
Kimberly Mann Bruch, External Relations, San Diego Supercomputer Center, UCSD
The EarthCube Office strives to assist geoscientists and the wider research community through tools, methods, standards, architectures, and community connections. Our group seeks high school students that are interested in writing pieces related to EarthCube projects from start to finish. As a result of this internship, students will come away with an ability to reduce technical jargon and other barriers for the creation of easy-to-understand written pieces. Students will have the opportunity to choose a geoscience domain that is of interest to them and create both written and video pieces regarding that topic.
Number of Students Requested: 2
Number of hours per week: 15-20 hours
Plan to Integrate Student into Group Activity:
Kim Bruch will supervise students with additional mentoring provided by Ben Tolo and Owen Stanley of the SDSC External Relations Group. An initial meeting will be used to outline the basics of the internship, with weekly meetings to keep everyone on track. Completed pieces may appear on the EarthCube website, with full credit given to students for their work so they may include it in their portfolios. Students will also participate in a poster session at the end of the internship.
Student Prerequisite:
- B or better, in at least one science or engineering course at the high school level.
- B or better, in an English class at the high school level.
- Proficiency with Google applications (docs, sheets, drive, etc).
Relevant Links:
- San Diego Supercomputer Center: http://www.sdsc.edu
- EarthCube: https://www.earthcube.org/
Kimberly Mann Bruch
Science Writer, External Relations, San Diego Supercomputer Center, UC San Diego
Neuroscience Gateway – Computational Modeling of Neurons and Processing of EEG Data
Amit Majumdar, San Diego Supercomputer Center, UCSD
Kenneth Yomomoto, San Diego Supercomputer Center, UCSD
The Neuroscience Gateway (NSG – https://www.nsgportal.org) provides access to supercomputing resources for computational and cognitive neuroscientists. Through a simple web-based portal, the NSG provides a user-friendly environment for uploading models or data, specifying supercomputing job parameters, querying running job status, receiving job completion notices, and storing and retrieving output data. The NSG distributes user jobs to appropriate supercomputing resources. REHS students will learn about supercomputing and NSG during the initial period of the internship. Followed by that REHS students will be involved in either computational modeling of neurons using the NEURON software or processing of Electroencephalography (EEG) data using the EEGLAB software. In computational modeling scientists build models of neurons or network of neurons that perform various functions in our brain. NEURON is a widely used software for such modeling of neurons. Students will learn about a modeling project as part of their internship. EEG project involves recording of brain’s electrical activities tied to some function a subject is performing, and processing the data using software (such as the EEGLAB) and computing resources to see connection between brain signals and the function. Students will record and process EEG data as a part of their internship.
We are interested in students who have some background and interest in programming and scripting and are interested in neuroscience, computer science, data processing and modeling. Some experience with programming languages and exposure to Linux systems are preferred.
Number of Students Requested: 3-6
Number of hours per week: 10-15 hours.
Plan to Integrate Student into Group Activity:
The student will be a part of the research team working on the larger scale project that includes the project described here. He or she will attend the group meetings and communicate with the team members using other methods of communication. The student will work closely with the lead person and the other personnel involved.
Student Prerequisite:
Some knowledge of web technologies such as JavaScript, MySQL database, HTML, & XML.
Relevant Links:
- Neuroscience Gateway: http://www.nsgportal.org/
- EEGLAB: https://sccn.ucsd.edu/eeglab/index.php
- NEURON: https://neuron.yale.edu/neuron/
- Video of introduction to NSG, and modeling of neurons and processing of EEG data using NSG: https://www.youtube.com/watch?v=_qehTEwrE0s
Kenneth Yomomoto
San Diego Supercomputer Center, UCSD
Amit Majumdar
San Diego Supercomputer Center, UCSD
Click here to learn more about Mr. Majumdar.
Additional Mentors:
- Arnaud Delorme, Swartz Center for Computational Neuroscience, UCSD
- Dung Truong, Swartz Center for Computational Neuroscience, UCSD
- Ted Carnevale, Department of Neurobiology, Yale University
Tracking Data Center Space with Jupyter Notebooks and NoSQL
Rick Wagner, San Diego Supercomputer Center, UCSD
This project will use Jupyter Notebooks as the front end to a NoSQL database tracking the data center space used by current and future HPC systems at SDSC. Students working on this project will learn about data modeling, authentication and authorization using OAuth/OIDC, and generating various graphics in Jupyter Notebooks. Documentation will be a key part of this project, since it will be extended by SDSC staff in the future.
We are interested in students who have some background and interest in programming and scripting and are interested in neuroscience, computer science, data processing and modeling. Some experience with programming languages and exposure to Linux systems are preferred.
Number of Students Requested: 1-2
Number of hours per week: 15-25 hours.
Plan to Integrate Student into Group Activity:
Prior to beginning the REHS program, the selected students will be provided with recommended programming exercises to help build the skills they will need to learn in order to successfully complete this project. Dr. Wagner will be available via email to provide guidance to the students on how to approach these exercises. During the first week of the REHS program, the student team will be given a background on the goals of the project and the tools that will be used. Each week the students will meet with Dr. Wagner to discuss their progress, any challenges they’re facing, and to plan the next steps. The students will also meet with the SDSC staff that will use this tool.
Student Prerequisite:
Some previous experience in one or more of the following areas: Jupyter Notebooks; programming in Python (preferred); web development; version control using Git and GitHub; or databases.
Relevant Links:
- Relevant web links:
- • The Jupyter Notebook Project Website, https://jupyter.org/
- • Globus
- • Documentation
- • Globus Search
- • Globus Python SDK
Rick Wagner, Ph.D.
San Diego Supercomputer Center, UCSD
Chief Technology Officer
Click here to learn more about Dr. Wagner.
Learning the Skills of Communications and Science Writing for SDSC Publications
Kimberly Mann Bruch, Science Writer, San Diego Supercomputer Center, UC San Diego
Our group seeks high school students who are interested in journalism with a focus on learning more about effectively communicating the results and societal benefit of science, engineering, and technological research projects. Candidates should have a strong desire to learn more about how to communicate technological research — with a keen eye for making the work both engaging and easily comprehended by a general audience. At the same time, students should be mindful of not diminishing the technical work of the research team in any way. Sound challenging?
As a result of this internship, students will come away with an ability to reduce technical jargon and other barriers for the creation of easy-to-understand written pieces. Students will have the opportunity to choose a science, engineering, and/or technological domain that is of interest to them and create written pieces regarding that topic as related to supercomputing.
Students will be assisted with both the science writing process as well as the final editing and completion of their articles with accompanying images, captions, and credits.
Number of Students Requested: 4
Number of hours per week: 15-20 hours.
Plan to Integrate Student into Group Activity:
Kim Bruch of the External Relations Group will supervise students. An initial meeting will be used to outline the basics of the internship, with weekly meetings to keep everyone on track. Completed pieces may appear on the SDSC and/or additional applicable websites, with full credit given to students for their work so they may include it in their portfolios. Students will also participate in a poster session at the end of the internship
Relevant Links:
Kimberly Mann Bruch
Science Writer, External Relations, San Diego Supercomputer Center, UC San Diego
Machine Learning in Computational Chemistry
Dr. Andreas Goetz, San Diego Supercomputer Center, UCSD
In this project, we will explore how modern machine learning approaches can be employed in computational chemistry. As a background, the work in our lab encompasses computer simulations based on quantum chemistry and molecular dynamics. We develop numerical models and software for atomistic simulations of condensed phase systems and apply these to solve problems in the chemical sciences, ranging from atmospheric chemistry to biochemistry and drug design. Our focus has been on multiscale methods that couple computationally demanding quantum descriptions of the electronic structure with computationally simpler classical interaction potentials in so-called QM/MM methods. This enables numerical simulations of realistic models that are intractable with pure QM methods. The importance of such methods has been recognized with the award of the 2013 Nobel Prize in chemistry for “the development of multiscale models of complex chemical systems”. While machine learning approaches have a long history in computational chemistry, recent advances in data storage and compute capabilities have ushered in a new era of data-driven approaches, which we seek to explore and combine with existing QM/MM models. We are involved in the development of several software packages that are used by many research groups in academia and industry. This includes the quantum chemistry software packages ADF and QUICK and the molecular dynamics simulations package AMBER. Several other major codes that are deployed on major national computer resources are also an integral part of our work.
In this REHS project, the students will explore machine learning approaches that may improve the speed or enhance the accuracy of current state-of-the-art QM/MM simulation approaches. We will jointly work through online course material that introduces machine learning concepts from simple classifications to deep learning, culminating in applications that predict chemical properties. We will make use of Jupyter notebooks, the Python programming language, and widely used machine learning software frameworks. We will explore the performance of the software on CPU and GPU hardware including the Expanse supercomputer at SDSC. Time permitting, we will generate training data using quantum chemistry software and develop models that can predict new data without having to run the expensive QM simulations in the first place. The interns will thus have the possibility to acquire a wide range of skills in data science, computational chemistry, and high-performance computing. As an essential part of the internship the students will learn how to document their research, prepare research reports and present their results to their peers – skills that are very important for a successful research and engineering career.
Number of Students Requested: Up to 2 students, learning together and working on complementary aspects such as different data sets to train deep neural networks for distinct sets of chemical properties.
Number of hours per week: 20-25 hours
Plan to Integrate Student into Group Activity:
Dr. Andreas Goetz who is actively working on the model development, software implementation and simulation projects will closely supervise the students, work jointly with the students through the training material, and will meet with the students on a regular basis. In addition, the students will have the opportunity to interact with other high school students working in the lab and in the REHS program.
Student Prerequisite:
The candidates should have an interest in machine learning, software development, and chemistry. Prior machine learning expertise is not required. Exposure to scripting or compiled programming languages or familiarity with Linux and command line environments is expected. Specific knowledge of any of these areas is less critical than exceptional intellectual ability and good work ethic.
Relevant Links:
- Dr. Goetz’ website: www.awgoetz.de
- AMBER software: www.ambermd.org
- QUICK software: https://github.com/merzlab/QUICK
- ADF software: www.scm.com
- Deep Learning for Molecules and Materials course: https://dmol.pub
Dr. Andreas Goetz
San Diego Supercomputer Center, UCSD
Elucidation of Common Molecular Mechanisms of Parkinson’s Disease, Alzheimer’s Disease, and Diabetes
Dr. Valentina Kouznetsova, Project Scientist, San Diego Supercomputer Center, UCD
Dr. Igor Tsigelny, Dept. of Neurosciences, San Diego Supercomputer Center, UCSD
Aggregation of specific proteins is associated with a number of diseases including Parkinson’s (PD), Alzheimer’s (AD), and other neurodegenerative diseases. Type II diabetes (T2D) is also linked to aggregation of Islet amyloid polypeptides (IAPP) IAPP is found to be significantly accumulated as extracellular amyloid deposits in the pancreas of T2D patients; it is shown that early oligomers are significantly cytotoxic species that lead to beta-cell apoptosis. The mechanisms through which IAPP leads to T2D are unclear. Various lines of evidence appeared to support the contention that IAPP oligomers built in the early stages of aggregation form annular protofibrils and consequently pores in the membranes the same way as amyloid-beta and alpha-synuclein correspondingly create pores in the cell membrane in AD and PD. Increase of calcium influx to the cells can lead to apoptosis in case of all these diseases.
The specific tasks of the project are using computer modeling and simulations to accomplish the following goals: (1) To characterize the interactions between IAPP, amyloid beta, and alpha-synuclein molecules in oligomers, and interactions between molecules and the membranes. (2) To validate a concept of self-assembly of these molecules, using a set of mutants known to enhance or inhibit IAPP aggregation process with identifiable structural properties.
The task will include extensive analysis of research articles in the field, related to modeling and simulations above mentioned proteins and their interactions with the membrane. Computational molecular modeling and molecular dynamics simulations.
Number of Students Requested: 1
Number of hours per week: 15-20 hours
Plan to Integrate Student into Group Activity:
The student will be a part of the research team working on the larger scale project that includes the project described here. He or she will attend the group meetings and communicate with the team members using other methods of communication. The student will work closely with the lead person and the other personnel involved.
Student Prerequisite:
We are looking for a student who is interested in both biology and computation.
Relevant Links:
- San Diego Supercomputer Center: http://www.sdsc.edu
Dr. Valentina Kouznetsova
Project Scientist, San Diego Supercomputer Center, UCSD
Click here to learn more about Dr. Kouznetsova.
Dr. Igor Tsigelny
Department of Neurosciences, San Diego Supercomputer Center, UCSD
Open Science Chain
Subha Sivagnanam, Principal Investigator, San Diego Supercomputer Center, UCSD
The Open Science Chain (OSC – http://www.opensciencechain.org) project uses blockchain technologies to protect the integrity and provenance of research artifacts. The project is looking for summer interns who are interested in working with the OSC project. Interns will have the opportunity to engage in various tasks, including developing blockchain applications, revamping the web portal, or assisting in unit test development, depending on their interests and expertise levels. Programming experience (Go, Java etc) and familiarity with Linux and container technology are preferred. This is a completely virtual internship.
Number of Students Requested: 1-2
Number of hours per week: 10-15 hours.
Plan to Integrate Student into Group Activity:
Students will participate in weekly meetings with mentors virtually over Zoom.
Student Prerequisite:
Coding skills (min one): Python, C, Java, Nodejs, Go or similar; Familiarity with Linux.
Relevant Links:
Subha Sivagnanam
Principal Investigator, San Diego Supercomputer Center, UCSD
Click here to learn more about Ms. Sivagnanam.
Additional Mentors:
- Scott Sakai, San Diego Supercomputer Center, UCSD
- Fernando Garzon, San Diego Supercomputer Center, UCSD
- Kai Lin, San Diego Supercomputer Center, UCSD
Interpreting and Demystifying Neural Networks
Lily Weng, Assistant Professor, Halıcıoğlu Data Science Institute &Department of Computer Science and Engineering, UCSD
Deep neural networks (DNNs) have demonstrated great success in Machine Learning and Artificial Intelligence. However, the black-box nature of DNNs makes it hard to understand the underlying decision making process, which is crucial before DNNs can be deployed into real-world safety-critical systems. In this project, we will develop methods to understand DNNs in a human-understandable way.
Number of Students Requested: 2
Number of hours per week: 20-25 hours.
Plan to Integrate Student into Group Activity:
Student researchers will be involved in group meetings (presenting papers and progress) during the project.
Student Prerequisite:
-Good programming skills in Python and experience in deep learning libraries such as Pytorch
– Good mathematical skills in Linear Algebra and Calculus
– Knowledge and Experience in Machine Learning is preferred
Relevant Links:
- Reference paper: https://openreview.net/pdf?id=Hcx62bSwg9
Lily Weng
Assistant Professor, Halıcıoğlu Data Science Institute &Department of Computer Science and Engineering
Click here to learn more about Ms. Weng.
Additional Mentors:
- Scott Sakai, San Diego Supercomputer Center, UCSD
- Vishu Nandigam, San Diego Supercomputer Center, UCSD
- Kai Lin, San Diego Supercomputer Center, UCSD
Optimizing the Deployment of Scientific Software on High-Performance Computing Systems via the Spack Package Manager
Marty Kandes, Ph.D., Computational & Data Science Research Specialist, San Diego Supercomputer Center, SDSC
Expanse is one of SDSC’s newest supercomputers. The result of a $26.5M award from the National Science Foundation (NSF), Expanse delivers more than 5 petaflops of computing power to scientists, engineers, and researchers all around the world. In its first two years of operations, Expanse has served over 5,000 unique users across more than 1,300 research projects spanning a range of scientific disciplines, becoming one of the most widely used supercomputers in the NSF’s Advanced Cyberinfrastructure Coordination Ecosystem: Services & Support (ACCESS) program.
The High-Performance Computing (HPC) User Services Group at SDSC helps manage user support across all of its HPC systems like Expanse. This includes, but is not limited to, building and maintaining an up-to-date end user software environment of high-performance compilers, advanced communication and math libraries, and domain-specific scientific applications that are optimized for each system’s unique hardware capabilities. On Expanse, the User Services Group is developing a new software deployment process using Spack, an open-source software package management tool that is designed to simplify and accelerate the building, installation, and customization of large, complex scientific and high-performance computing software environments. While the current production software environments available on Expanse are managed by Spack, the deployment and maintenance of each environment remains a largely manual and less-than-efficient, somewhat error-prone process.
The aim of this research project is to optimize and improve the Spack-based software deployment process on SDSC’s HPC systems. During the course of this project, you will learn how to:
- install software with Spack,
- schedule and run software build jobs on SDSC’s supercomputers like Expanse via the Slurm workload manager, and
- analyze and present benchmarking results for these build jobs.
Number of Students Requested: 2
Number of hours per week: 15-20 hours.
Plan to Integrate Student into Group Activity:
Prior to the start of the REHS program, the selected student researchers will be provided with a set of recommended programming tutorials to help build the skills necessary to successfully complete the project by the end of the summer. Dr. Kandes will be available via email and/or video conference meetings to provide any additional guidance the students may need on how to approach the tutorial exercises during this time. During the first week of the REHS program, the student researchers will then work closely with Dr. Kandes to build a research plan that clearly defines the milestones of the project in order to meet its goals. Thereafter, the student researchers will be expected to largely work independently and with one another on the project, but meet regularly with Dr. Kandes each week to provide updates on their progress, ask questions, and discuss any technical issues they’ve encountered. During the REHS program, the student researchers will also likely interact with and work with other members of the SDSC HPC User Services Group on portions of their project and be invited to attend regular group meetings.
Student Prerequisite:
Applicants should have a demonstrated interest in computer science and mathematics, a basic understanding of data analysis and visualization techniques, and some previous programming experience.
Relevant Links:
- https://spack.readthedocs.io/en/latest/
- https://spack-tutorial.readthedocs.io/en/latest/_static/slides/spack-sc22-tutorial-slides.pdf
- https://github.com/spack/spack
- https://github.com/sdsc/spack
- https://e4s-project.github.io/
- https://www.sdsc.edu/services/hpc/expanse/
- https://sdsc.edu/services/hpc/tscc/index.html
- https://access-ci.org/
Marty Kandes
Ph.D., Computational & Data Science Research Specialist, San Diego Supercomputer Center, SDSC
Click here to learn more about Mr. Kandes.