Workshop on Teaching Computational Science (WTCS) Session 1

Time and Date: 10:35 - 12:15 on 6th June 2016

Room: Rousseau West

Chair: Alfredo Tirado-Ramos

219 Enhancing Computational Science Curriculum at Liberal Arts Institutions: A Case Study in the Context of Cybersecurity [abstract]
Abstract: Computational science curriculum developments and enhancements in liberal arts colleges can face unique challenges compared with larger institutions. We present a case study of computational science curriculum improvement at a medium sized liberal arts university in the context of cybersecurity. Three approaches, namely a cybersecurity minor, content infusion into existing courses, and a public forum are proposed to enrich the current computational science curriculum with cybersecurity contents.
Paul Cao, Iyad Ajwa
191 Teaching Data Science [abstract]
Abstract: We describe an introductory data science course, entitled Introduction to Data Science, offered at the University of Illinois at Urbana-Champaign. The course introduced general programming concepts by using the Python programming language with an emphasis on data preparation, processing, and presentation. The course had no prerequisites, and students were not expected to have any programming experience. This introductory course was designed to cover a wide range of topics, from the nature of data, to storage, to visualization, to probability and statistical analysis, to cloud and high performance computing, without becoming overly focused on any one subject. We conclude this article with a discussion of lessons learned and our plans to develop new data science courses.
Robert Brunner, Edward Kim
422 Little Susie: a PXE installation of openSUSE on a Little Fe [abstract]
Abstract: Little Fe is a six node Beowulf cluster made from mini-itx motherboards. It is designed to be a low-cost portable parallel computer for educational purposes. Bishop's Theoretical Molecular Biology Lab at Louisiana Tech has reconfigured a Little Fe to model the lab's openSUSE based network. Our Little Susie boots each of its diskless nodes with the same openSUSE operating system installed on the lab's workstations. All nodes utilize a common home directory that is physically attached only to the head node. Thus Little Susie allows students to practice using, maintaining and administering a computer network that has all of the features and tools of the lab's research resources but without compromising lab workstations. In theory, our Preboot Execution Environment (PXE) solution supports installation of any live linux distribution on the Little Fe creating a family of Littles: Little Susie, Little Debbie, Litte Hat, Little Mints. The advantage of this approach over Little Fe's Bootable Cluster CD (BCCD) operating system is that each node of Little Susie has a complete linux distribution installed on each node. Little Susie can thus function as six independent linux workstations or as a Beowulf parallel computer. This approach allows instructors to set up a computational science teaching lab “on the fly” as follows: The instructor setups up a PXE server—head node. Students PXE boot their laptops at the beginning of class to obtain identically configured workstations for the lesson of the day. After saving the day's work to the instructors hard drive students restore their laptop to its native state by simply rebooting. Instructions for setting up a Little Susie and a parallel molecular dynamics simulation with NAMD/VMD will be presented.
Tom Bishop and Anthony Agee
226 The Scientific Programming Integrated Degree Program - A Pioneering Approach to join Theory and Practice [abstract]
Abstract: While already established in other disciplines, integrated degree programs have become more popular in computer science and mathematical education in Germany as well over the last few years. These programs combine a theoretical education and a vocational training. The bachelor degree course "Scientific Programming", offered at FH Aachen University of Applied Sciences, is such an integrated degree program. It consists of 50% mathematics and 50% computer science. It incorporates the MATSE (MAthematical and Technical Software dEveloper) vocational training in cooperation with research facilities and IT companies located in and nearby Aachen, Jülich and Cologne. This paper presents the general concept behind integrated degree programs in Germany and the Scientific Programming educational program in particular. A key distinguishing feature of this concept is the continuous combination of theoretical education at university level with practical work experience at a company. In this fashion, students end up being very well positioned for the labor market, and companies educate knowledgeable staff familiar with their products and processes. Additionally students are able to earn two degrees in three years, which is a rare approach for computer science programs in Germany. Therefore, Scientific Programming offers an important contribution towards reducing the shortage in advanced software development and engineering on the German labor market.
Bastian Küppers, Thomas Dondorf, Benno Willemsen, Hans Joachim Pflug, Claudia Vonhasselt, Benedikt Magrean, Matthias S. Müller, Christian Bischof
234 Teaching computational modeling in the data science era [abstract]
Abstract: Integrating data and models is an important and still challenging goal in science. Computational modeling has been taught for decades and regularly revised, for example in the 2000s where it became more inclusive of data mining. As we are now in the `data science' era, we have the occasion (and often the incentive) to teach in an integrative manner computational modeling and data science. In this paper, we reviewed the content of courses and programs on computational modeling and/or data science. From this review and our teaching experience, we formed a set of design principles for an integrative course. We independently implemented these principles in two public research universities, in Canada and the US, for a course targeting graduate students and upper-division undergraduates. We discuss and contrast these implementations, and suggest ways in which the teaching of computational science can continue to be revised going forward.
Philippe Giabbanelli, Vijay Mago

Workshop on Teaching Computational Science (WTCS) Session 2

Time and Date: 14:30 - 16:10 on 6th June 2016

Room: Rousseau West

Chair: Angela Shiflet

209 Educational Module on a HPC Bioinformatics Algorithm [abstract]
Abstract: Prof. Angela Shiflet in computer science and mathematics and Prof. George Shiflet in biology are Fulbright Specialists. In January, 2015, they participated in a three-week collaborative project at University “Magna Græcia” of Catanzaro in Italy, in the Department of Medical and Surgical Sciences, hosted by Prof. Mario Cannataro. While there, the three along with Prof. Pietro Hiram Guzzi started a project to develop educational module(s) on one or more high-performance-computing bioinformatics algorithms. Drs. Cannataro and Guzzi have written a book, Data Management of Protein Interaction Networks (Wiley, 2011), and regularly teach bioinformatics and HPC. Upon returning to the United States, the Drs. Shiflet applied to have undergraduate Daniel Couch be a Blue Waters Intern for one year working on the project. The NSF-funded Blue Waters Project, which provides a stipend for the intern, supports “experiences involving the application of high-performance computing to problems in the sciences, engineering, or mathematics” (http://computationalscience.org/bwsip/). Besides having had an HPC course, the student participated in a two-week workshop at the National Center for Supercomputing Applications (NCSA) facilities on the University of Illinois Urbana-Champaign campus. In the project, he has written sequential and HPC programs and performed timings to accompany an educational module on “Aligning SequencesSequentially and Concurrently,” available at http://www.wofford.edu/ecs/, and is working with the professors on developing other modules and programs. After covering the necessary biological background, the named module develops the sequential Needleman-Wunsch Algorithm (NWA) to determine the similarity and the alignment(s) that yield a highest similarity score. Employing timings developed by the intern, the module illustrates that the algorithm’s runtime is proportional to the square of the number of nucleotides. Having motivated the need for HPC, the module discusses HPC pipeline versions of NWA along with timings. To aid students, the module contains fifteen Quick Review Questions, many with multiple parts; nine exercises; and five projects. Completed sequential and parallel C with MPI programs are available upon request by instructors. The materials are current being used by students and faculty members in a bioinformatics course at University “Magna Græcia” of Catanzaro.
Angela Shiflet, George Shiflet, Daniel Couch, Pietro Guzzi and Mario Cannataro
202 A Practical Parallel Programming Course based on Problems of the Spanish Parallel Programming Contest [abstract]
Abstract: This paper presents an experience of an introductory course on Parallel Programming. The course is dedicated to parallel programming tools and environments, and in particular to the analysis, development and optimization of parallel algorithms. It has a practical orientation and is guided with the use of problems from the Spanish Parallel Programming Contest. The different units are presented in the traditional lecture format, and a practical session accompanies each unit, with problems to work with in the tools or algorithmic paradigms presented in the previous lecture. The students work in the practical sessions on problems and using the system of the contest, which facilitates online and real time validation of their implementations. The practical approach of the course and the continuous evaluation used led to an important increase in the marks.
Domingo Gimenez
246 Using Principles from the Learning Sciences to Design a Data-Driven Introduction to Computational Modeling [abstract]
Abstract: In this talk we discuss designing, implementing, and researching an Introduction to Computational Modeling course for university undergraduates. The course is part of the brand-new Computational Mathematics, Science, and Engineering (CMSE) department at Michigan State University (MSU). It was specifically designed to be interdisciplinary; the course is open to any major at the university. The course was also built to address the growing need for a workforce that can analyze, model, and interpret real-world data. Our talk will cover three strands of developing the course: 1. Curriculum Design: how we worked backwards from the professional disciplinary practices of modeling to arrive at well-defined learning outcomes, assessments, and course content. 2. Instructional Environment: The key decisions we made and technologies we chose to bring the experience of modeling to the classroom. 3. Educational Research: How we’re using methods from the learning sciences—including clinical interviews and ethnographic classroom observation—to understand students’ experiences in the course and continually integrate findings into the course design
Brian Danielak, Brian O'Shea and Dirk Colbry
410 Modeling Knowledge Transfer... [abstract]
Abstract: Each scientific project or publication can be attributed to several fields of study with different degrees. Call interdisciplinary distribution the set of the degrees. If we have fixed number of fields of study, the set can be written in vector form. Each component of the vector corresponds to one of the fields of study. Call the vector the interdisciplinary vector. If we consider a scientist to be a set of his or her publications we can get the vector for a scientist as a weighted sum of a vector of his or her publications. This paper is devoted to an approach to the evaluation of the interdisciplinary distribution of professional or research objects (RO), and the transdisciplinary effects of their changes. RO and professionals can be evaluated on the basis of keywords in relevant scientific papers, reports, surveys, proposals, CVs, and so on. The transdisciplinary effect is apparent when the interdisciplinarity distribution has been changed. We propose formulas to evaluate this transdisciplinary effect. This approach was implemented using participants in group projects at the fourth Young Scientists Conference (YSC) on High-Performance Computing and Computer Simulation. The accuracy of the interdisciplinary vector of several participants was examined by the survey about their involvement in the team projects. This approach can be used to evaluate the compliance of a scientific team with the transdisciplinary research project (problem), as well as to assess the students' skills in transdisciplinary environments.
Nikita Kogtikov, Alexey Dukhanov, Klavdiya Bochenina