An Applied Multidisciplinary Learning Journey in Data Science

Big Data Program Banner
For those studying (Information Technology, Health, Human Rights, Media, Banking and Finance, Business Administration, Law and Education). It is an intensive training program regarding the manufacturing of digital applications with our local partners. It is done within a digital science program provided by the Continuing Educational Unit in Birzeit University, in partnership with the National Development Research Center in Canada.   

Deadline for application submission was 16th of February 2021.

The Training Courses:

The program consists of two interrelated routes; one is for the Information Technology Major, and the other is for all other majors. It includes the following courses:  

Courses in the "Applied Multidisciplinary Learning Journey in Data Science" Program

Introduction to Data

During this Course you'll learn about Data Science, the goals and objectives of the Data Science Specialization and each of its components. You'll also get to know why data scientists are now in such demand, and the skills required to succeed in different jobs. Different business cases are presented during session implementation as case studies to view and discuss the data science economical potential.
 

Introduction to Python (for IT)

Python is a popular, easy to learn programming language. It is commonly used in the field of data analysis because there are very efficient libraries available to process large amounts of data. This so-called data analysis stack includes libraries such as NumPy, Pandas, and Matplotlib that we will familiarize ourselves with. In this course, an overview is given of the different phases of the data analysis pipeline using Python and its data analysis ecosystem. What is typically done in data analysis? We assume that data is already available, so we only need to download it. After downloading the data it needs to be cleaned to enable further analysis. In the cleaning phase the data is converted to some uniform and consistent format. After which the data can, for instance, be combined or divided into smaller chunks; grouped or sorted; condensed into small number of summary statistics; numerical or string operations can be performed on the data.
 

Introduction to Python (Non-IT)

Python is a popular, easy to learn programming language. It is commonly used in the field of data analysis because there are very efficient libraries available to process large amounts of data. This so-called data analysis stack includes libraries such as Pandas, and Matplotlib that we will familiarize ourselves with in this course.

Data Analysis ( for IT + Non-IT)

This module presents the assumptions, principles, and techniques necessary to gain insight into data via EDA--exploratory data analysis. we will be covering the essential exploratory techniques for summarizing data. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data.
 

Data Management Plan (DMP)

We will provide learners with an introduction to research data management and sharing. After completing this course, learners will understand the diversity of data and their management needs across the research data life cycle, be able to identify the components of good data management plans, and be familiar with best practices for working with data including the organization, documentation, and storage and security of data. Learners will also understand the impetus and importance of archiving and sharing data as well as how to assess the trustworthiness of repositories.
 

Machine Learning

This Course will introduce Machine Learning concepts and practical information on how Machines learn to perform certain tasks as those used in the Artificial Intelligence world.

Big Data for Non-IT

In recent years, big data has become a popular buzzword when talking about growth in the technology sector. However, there are confusions about what it is exactly, especially for people who are not involved directly in technical departments. This can be due to lack of leadership sophistication around the concept and its associated possibilities. Big Data: A Beginner’s Guide for Non-Technical People is a guide to address that. We will go through the core concepts and ideas behind Big Data and with the goal of keeping it jargon-free and simple.
 

Big Data for IT

The main objective of this course is to familiarize the students with recent technological advancements in manipulating, storing, and analyzing big data. The emphasis of the course will be on practicing different components of Apache Spark, as the most important big data framework. Students will gain hands-on experience through multiple practices on Spark SQL, Spark ML (Machine Learning) API and Spark Streaming. In addition, topics in analyzing huge amount of textual content using Spark NLP and Elasticsearch technology will be covered as well.
 

 

Conditions of Enrollment for IT Graduates:

  • Good grades or practical experience in (Algorithms & Design)
  • Good grades or practical experience in (Data Structure)
  • Knowledge in software engineering 
  • Knowledge in (Web and Mobile Developments)
  • Prepared to work in groups of people from different educational fields, to execute practical projects

Conditions of Enrollment for other Majors:

  • Deep knowledge or practical experience in his/her major. 
  • Passionate about merging data science with their major or work. 
  • Prepared to work within groups of people from different educational fields, to execute practical projects. Those who are passionate about entrepreneur startup projects are preferred.
  • Able to read and write in English. 

Register