All undergraduate students*, graduate students, and postdocs are welcome to register for this workshop. CSoI member students will be given priority registration. No training in Computer Science or expertise in any particular area is needed. Students from all data-driven disciplines are welcome.
The intended audience for the workshop are students at all levels who have not (yet) delved into a data science experience, but want to begin working in this area. As an outcome of this experience, students will be acquainted with several tools for data analysis.
Central to the diversity mission of the Center encourages women and other underrepresented groups including U.S. citizens, nationals, and permanent residents to apply. The spirit of this workshop is to bring together students and postdocs from multiple fields to lower barriers for understanding the language and approaches across multiple disciplines and data science.
* undergraduate students must be U.S. citizens or permanent residents. Registration for the workshop does not guarantee acceptance.
Dr. Mark D. Ward will introduce participants to some of the first principles and concepts from data analysis. Students who complete the workshop will learn several technologies, including skills for data wrangling and data visualization. We will utilize the R platform for data analysis, and will discuss strategies for reproducible research. As time permits, we will discuss how to use R to interact with SQL databases, how to scrape and parse XML code, and how to utilize techniques of data visualization, and the use of LaTeX. See the Pre-workshop section below to access the online short course, Introduction to R for Data Science in preparation for the R and data analysis sessions held during the workshop.
All of the topics will be offered in team-oriented projects. We will choose several sources of data, so that the workshop will have an interdisciplinary appeal. This enables students to develop a sense of community and confidence about introductory topics in data analysis.
"Through this workshop, I learned where to access all the big data that I am interested in analyzing. Having the opportunity to work on real data using a variety of tools in R definitely increases my confidence in entering the field of data science. This workshop also helped me with my professional development. I learned how to make clear and approachable presentations from many of the presentations at the workshop. Another important aspect that I got from this workshop is how much we can achieve through teamwork. We were not sure what we could extract from our data on the first day, but as we played with the data more and discussed more as a team, the data morphed into a very interesting project."
“This workshop provided an environment to let me deeply communicate with researchers from other fields. Within our team, we were able to explain our research and exchange thoughts without using jargon, which is a very important skill in our careers. On top of that, our team was able to generate an algorithm that progressed our research.”
“I presented an idea about bringing informatics to a [clinical medical issue] which was only nascent. Interactions with peers and faculty mentors who found this idea interesting in this workshop has made the proposal much more concrete and grounded in reality. This experience has proved to be invaluable with my vision and in formulating specific technical problems that will benefit from being solved.”
“I experienced how a solution to a problem can be approached from different perspectives. Brainstorming sessions, exploring new techniques, continuously looking forward for an efficient algorithm with compatibility really helps a person to be a valuable team member for any project in the future.”
“Being a fresh BSc graduate and having the opportunity to work with a post-doc and a Phd student provided me relevant exposure to a research environment. I developed new ways of thinking about problem solving by working with them. Furthermore, the interdisciplinary nature of the workshop encouraged a high level of interaction. I have come to realize the importance and power of expression of simple yet elegant visualizations to represent complex research is simply phenomenal. I was surprised and thrilled to have accomplished so much in such a short period of time.”
Lead Instructor: Dr Mark D. Ward, Associate Professor and Associate Director of Center for Science of Information, Purdue University
TA: Yucong Zhang, PhD candidate, Department of Statistics, Purdue University
Workshop Chair: Brent T. Ladd, Director of Education, Center for Science of Information, Computer Science, Purdue University
Co-chair: Kelly Andronicos, Director of Diversity, Center for Science of Information, Computer Science, Purdue University
Agenda for the Week (tentative)
Prior to the workshop, attendees will complete a brief four-week online course - free of charge - introducing them to the R environment and working with data sets during the weeks before the workshop starts [approx. 4 hrs per week commitment] This short course is presented online through tutorials recorded by Dr. Mark D. Ward, and results in all students starting the workshop with a comparable level of experience using R.
Complete the Introduction to R for Data Science - These tutorials and exercises are available through the CSoI Learning Hub. An email listserv will be available soon for use during the course starting April 25.
Each attendee is a member of a team based partly on interests and background, and partly on available team spots. We do our best to form teams with balance among postdoc, graduate students, and undergraduate students. Each team has one member that submitted a potential project with data that the team can work on. Read the one-page overview Best Practices for Successful Formation of Interdisciplinary Science Teams
Research Teams Funding Opportunity:
Following the workshop teams will have an opportunity to submit funding proposals to the Center to continue their research collaborations. You can view currently active and past student-postdoc research teams that received funding from the Center - Research Teams.
Lodging will be provided and paid for by the Center for all non-Purdue students. Students from outside Purdue requiring lodging will be staying at First Street Towers.
Off-campus students should plan to arrive at Purdue by Sunday evening, May 20th. The workshop will conclude Noon on Friday, May 25th.
Lunch will be catered onsite Monday through Thursday. Additionally, all attendees will be issued a dining card for use in the Wiley dining hall for breakfast and dinner Monday, May 21st - Thursday, May 24th, and all meals will be at the Wiley dining hall for Friday, May 25th. Wiley is directly across from the Purdue Co-Rec Gymnasium. For any additional food or drinks that you want to pay for with your own $ can be easily purchased from the Lawson Computer Science "Port" Cafe between the hours of 8am-2pm (this cafe is on the first floor of the building where our workshop is held). Note, for Sunday arrivals and Saturday departures we do not provide any meals and dining halls are closed, however, there are several options available by walking two blocks west of First Street Towers in the Purdue West shopping plaza, or walking about 1 mile east along State Street to the Chauncey Hill Mall area.
Note that you will want the Purdue West/Follett's location for shuttle drop-off/pick-up to and from the airport. This is along McCutcheon Drive by Follett's bookstore, and is directly two blocks west from the First Street Towers dorm .
Monday morning when you arrive at Lawson 3102 for the workshop, you will have with your name badge 1)A login for the Scholar server we will be using to access software and data, and for non-Purdue attendees 2) A login for the Purdue wireless network (PAL 3.0) and instructions for connecting.
Students and Postdocs from outside Purdue are eligible to apply for reimbursement of airfare and shuttle expenses to attend the full week workshop. Use the link above, and follow the instructions and forms provided. If there are any questions regarding travel reimbursement contact Robynne McCormick firstname.lastname@example.org.
This time in May is the quietest on the Purdue campus. Not a lot is happening this week - other than our workshop! Most evenings your team is encouraged to spend time together. In addition to eating dinner and meeting as a team during evenings, you might be interested in checking out the Purdue CoRec - one of the nicest facilities of its kind in the country. Check the summer hours before heading there. It is a short walk from your dormitory. Your Dining Card will give you access to the Co-Rec.
There are a number of unique shops, bars, and dining at the east end of campus, about a 15-minute walk directly east from First Street Towers. You can also find out what else is happening in the area during the evenings at HomeofPurdue.com. Although Purdue is a very safe campus, we strongly recommend walking at night as a group, and never alone, especially in areas off of campus.