ECTS credits ECTS credits: 3
ECTS Hours Rules/Memories Hours of tutorials: 1 Expository Class: 10 Interactive Classroom: 11 Total: 22
Use languages English (100%)
Type: Ordinary subject Master’s Degree RD 1393/2007 - 822/2021
Departments: Electronics and Computing, External department linked to the degrees
Areas: Languages and Computer Systems, Área externa M.U en Intelixencia Artificial
Center Higher Technical Engineering School
Call: First Semester
Teaching: With teaching
Enrolment: Enrollable | 1st year (Yes)
The objective of the course is the introduction of the basic aspects of data engineering, fundamentally in the field of Big Data. The skills acquired will allow the efficient analysis and management of heterogeneous information, both structured and unstructured, within the development of AI applications, where traditional methods show their insufficiency.
- Concepts and foundations of data engineering: Basic concepts and definitions, efficient data loading problems in Big Data scenarios, massive data storage and access.
- Data cleaning and preparation techniques: Most common techniques, definition of streaming flows, data quality metrics.
- Advanced data structures and efficient data warehouses for Big Data: Multidimensional Data Warehouses and Databases, Data Lakes, NoSQL Databases.
Basic Bibliography:
Sadalage, Fowler. NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence, Addison-Wesley, 2012.
Avi Silberschatz, Henry F. Korth, S. Sudarshan, Database System Concepts, Sixth edition, McGraw-Hill, 2010. ISBN 0-07-352332-1
Ihab F. Ilyas and Xu Chu. 2019. Data Cleaning. Association for Computing Machinery, New York, NY, USA.
Alex Gorelik, The Enterprise Big Data Lake: Delivering the Promise of Big Data and Data Science, O’ Reilly Media, Inc., 2019. ISBN: 9781491931554
Complementary Bibliography:
Matt Casters, Roland Bouman, Jos van Dongen,, Pentaho Kettle Solutions: Building Open Source ETL Solutions with Pentaho Data Integration, 978-0470635179, Wiley, 2013.
The Degree competences considered in this course (see the Degree definition document) are the following:
Basic and general skills: CG2, CG3, CG4, CG5, CB6, CB7, CB8.
Transversal competences: CT3, CT7, CT8, CT9.
Specific skills: CE16.
More specifically, the student will be able to:
- Acquire the skill of analysis and data modeling for data processing in intelligent systems.
- Acquire knowledge and understanding related to the data extraction, cleaning, transformation, loading and processing.
- Acquire knowledge related to the use of multidimensional and NoSQL databases.
- Acquire knowledge related to the fundamentals of data lakes and data warehouses.
The methodology of this course will be based on the combination of three types of face-to-face activities with autonomous work of the students.
Project-based learning: Practical projects are given to the students whose scope requires that an important part of the total dedication of the student is dedicated to the course. In addition, due to the scope of the work to be carried out, it is required not only that the students apply management skills but also technical skills.
Master class: The instructor presents a topic to the students with the aim to provide specific knowledge. This teaching methodology will be applied to the training activity of classes of theory.
Laboratory work: The teaching staff of the course poses to the students one or various problems of practical nature whose resolution requires the understanding and application of the theoretical-practical knowledge included in the contents of the course.
The students may work on the proposed problems individually or in groups. This teaching methodology will be applied to the training activity "Practical laboratory classes" and may be applied to the training activity of "Problem-based learning sessions, seminars, case studies and projects".
Autonomous problem solving: The teaching staff proposes to the students a problem whose scope and objectives require from them autonomous work. The process of design and implementation of the solution to the problem will be supervised by the teaching staff. In general, this methodology is applied to pieces of work with a time scope and greater effort than those of laboratory work.
Conditions applied to the two opportunities in June and July.
Project-based learning (70%): The solution provided by the student will be defended before the teacher or an oral presentation of the solution developed will be carried out. For the evaluation of the project, the autonomous work of the members of the group (40%) and the defense of the work in public session (30%) will be taken into account.
Laboratory practices (30%): Several deliveries related to the different parts of the work carried out in the practical sessions will be evaluated. The evaluation of these practices will be individual.
Those students who deliver the project are considered presented.
To pass the subject in any call, the final grade must be equal to or greater than 5, obtaining at least a 5 (out of 10) in each of the parts.
In the extraordinary calls, a specific delivery of an individual project will be designed in which the entire subject will be evaluated.
12 face-to-face hours of theory classes. 12 face-to-face hours of laboratory classes and project-based learning. 50 hours of autonomous and personal work of the students
Follow the proposed methodology, attending classes, devoting the necessary time to study and carrying out assignments and solving specific problems with the help of teachers in tutorial sessions
The virtual campus will be used to improve communication between students and teachers, to host the necessary material and to support the evaluation processes
Jose Ramon Rios Viqueira
Coordinador/a- Department
- Electronics and Computing
- Area
- Languages and Computer Systems
- Phone
- 881816463
- jrr.viqueira [at] usc.es
- Category
- Professor: University Lecturer
| Monday | ||
|---|---|---|
| 15:30-17:00 | Grupo /CLE_01 | IA.02 |
| Wednesday | ||
| 15:30-17:00 | Grupo /CLIL_01 | IA.02 |
| 12.22.2023 16:00-20:45 | Grupo /CLE_01 | IA.02 |
| 12.22.2023 16:00-20:45 | Grupo /CLIL_01 | IA.02 |
| 06.18.2024 16:00-20:45 | Grupo /CLE_01 | IA.02 |
| 06.18.2024 16:00-20:45 | Grupo /CLIL_01 | IA.02 |
| Teacher | Language |
|---|---|
| RIOS VIQUEIRA, JOSE RAMON | English |
| Teacher | Language |
|---|---|
| RIOS VIQUEIRA, JOSE RAMON | English |
| Teacher | Language |
|---|---|
| RIOS VIQUEIRA, JOSE RAMON | English |