02.149.163001 Seminar: Bridge from Statistics to Data Science

Course offering details
Close 

Instructors: Michel Schilperoord

Event type: online: Seminar

Displayed in timetable as: S Fields Practice

Credits: 7,0

Language of instruction: Englisch

Min. | Max. participants: - | 30

Contents:
Title of the Course:

Bridge from Statistics to Data Science


Overview:

Computational Social Science (CSS), a young but growing research area at the intersection of social science, computational science and complexity science, refers to the use of (advanced) computational approaches in studying social phenomena. The main CSS areas are automated information extraction systems (e.g. automated text analysis), social network analysis, social geographic information systems (GIS), complexity modeling, and social simulation (e.g. agent-based simulation models). Skills of computational social scientists are built on foundations of statistical analyses done in Stata, SPSS, or another such program, through learning/exploring new skillsets uncommon in the social sciences that are developed by computer scientists and computational statisticians. These include network analysis, natural language processing and machine learning techniques, and the programming languages R and Python, all associated with doing "data science" in the "age of big data".

This seminar offers a course that functions as a bridge between statistical courses such as "Statistik Soziologie" and "Stata Soziologie" and introductory-level CSS (Computational Social Science). In the course, students will be introduced to applications of data science that can be relevant to their studies. It opens up questions with regard to opportunities and limitations that characterize the state-of-the-art for each computational method, with methodological focus in network analysis, natural language processing and machine learning. It will also assist in gaining hands-on experiences for programming (basic) applications of data science in R and Python, building on previously learned skills for doing (basic) statistics in Stata. Along hands-on class work, it will further assist in developing appreciation for common elements of data science workflows, for instance: data exploration and visualization, modelling and simulation, and communication of data science results.


Learning outcomes:

This course covers a survey of practical examples of how CSS researchers with a foundation in statistics/Stata can apply (basic) methods of Computational Social Science for achieving a better understanding of certain social and economic issues and problems. On the theoretical side, it will provide an overview of CSS and foundational knowledge on its common methods, their differences, with key literature systematically reviewed. On the practical side, it will provide guidance for reasoning about which types of data science methods may be suitable for application to certain social and economic issues and problems, and how to make choices with regard design and implementation of a data science project.


Course Requirements and assignments:

Assignment 1: Pre-class preparation

This assignment is about producing excerpts (1 page per text) from the text book.

Each student is asked to agree with the instructor on seven excerpts taken from the different sections of the textbook.

The excerpts will be uploaded on JOGU-StINe five days before the block seminar dates.

Assignment 2: Class presentation

This assignment is about presentation of a topic in class.

Each student is asked to discuss his/her choice with the instructor, and prepare a session from the syllabus. In certain cases, this can be done in cooperation.

Assignment 3: Data science project

This assignment is post-class work on a data science project concerning one topic of the syllabus (mostly the one chosen for presentation). The students need to present a short abstract and the contents structure for their project to the instructor 10 days after the block seminar dates. The deadline for essay submission is the end of Summer Term.


Textbook for the Course:

Cioffi-Revilla C. (2014) Introduction to Computational Social Science (Texts in Computer Science). London: Springer.

Appointments
Date From To Room Instructors
1 Mon, 18. Oct. 2021 14:15 15:45 Online Michel Schilperoord
2 Mon, 25. Oct. 2021 14:15 15:45 Online Michel Schilperoord
3 Mon, 8. Nov. 2021 14:15 15:45 Online Michel Schilperoord
4 Mon, 15. Nov. 2021 14:15 15:45 Online Michel Schilperoord
5 Mon, 22. Nov. 2021 14:15 15:45 Online Michel Schilperoord
6 Mon, 29. Nov. 2021 14:15 15:45 Online Michel Schilperoord
7 Mon, 6. Dec. 2021 14:15 15:45 Online Michel Schilperoord
8 Mon, 13. Dec. 2021 14:15 15:45 Online Michel Schilperoord
9 Mon, 3. Jan. 2022 14:15 15:45 Online Michel Schilperoord
10 Mon, 10. Jan. 2022 14:15 15:45 Online Michel Schilperoord
11 Mon, 17. Jan. 2022 14:15 15:45 Online Michel Schilperoord
12 Mon, 24. Jan. 2022 14:15 15:45 Online Michel Schilperoord
13 Mon, 31. Jan. 2022 14:15 15:45 Online Michel Schilperoord
Course specific exams
Description Date Instructors Mandatory
1. Term Paper Th, 31. Mar. 2022 23:58-23:59 Michel Schilperoord No
Class session overview
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
Instructors
Michel Schilperoord