2180713 WDM Syllabus
2180713 WDM Syllabus
Web Data Management 2180713 Syllabus
GUJARAT TECHNOLOGICAL UNIVERSITY
COMPUTER ENGINEERING
WEB DATA MANAGEMENT
SUBJECT CODE:2180713
B.E. 8th SEMESTER
Type of course: Bachelor of Engineering
Prerequisite: Nil
Rationale: This course provides an in depth study of the area of web data management covering XML, XLink, and XPointer. The course primarily covers the state of the art in designing and building web applications and services, primarily focusing on issues and challenges that revolve around the management and processing of data. It also highlights the concepts of Ontology, RDF and OWL. It covers Building Web scale applications using web search, web crawlers, Web Graph mining, Map Reduce etc.
Teaching and Examination Scheme:
Teaching Scheme | Credits | Examination Marks | Total
Marks |
|||||||
L | T | P | C | Theory Marks | Practical Marks | |||||
ESE
(E) |
PA (M) | ESE (V) | PA
(I) |
|||||||
PA | ALA | ESE | OEP | |||||||
3 | 0 | 2 | 5 | 70 | 20 | 10 | 20 | 10 | 20 | 150 |
Content:
Sr. No. | Content | Total Hrs | % Weightage |
1 | Data Model
Introduction to Modeling Web Data, Semistructured data, XML, Web Data Management with XML, XML Standards, XML and syntax, XML Data Model, XLink, and XPointer. |
05 | 10 |
2 | XPath and XQuery
Introduction, Basics of XPath and XQuery, XPath: Steps and path expressions, Evaluation of path expressions, Generalities on axes and node tests, Axes, Node tests and abbreviations, Predicates, XPath 2.0; FLWOR expressions in XQuery: Defining variables – the for and let clauses, Filtering – the where clause, The return clause, Advanced features of XQuery; XPath foundations. |
06 | 15 |
3 | Typing
Motivating Typing, Automata, Schema Languages for XML, Typing Graph Data: Graph Semistructured Data, Graph Bisimulation, Data guides. |
04 | 10 |
4 | XML Query Evaluation
XML fragmentation, XML identifiers: Region-based identifiers, Dewey-based identifiers, Structural identifiers and updates; XML evaluation techniques: Structural join, Optimizing structural join queries, Holistic twig joins. |
04 | 10 |
5 | Ontologies, RDF, and OWL | 05 | 15 |
Introduction, Ontologies by example, Web resources, URI, namespaces, RDF, RDFS: RDF Schema, OWL, Ontologies and (Description) Logics. | |||
6 | Querying Data through Ontologies
Introduction, Querying RDF data: notation and semantics, Querying through RDFS ontologies, Answering queries through DL-LITE ontologies. |
04 | 10 |
7 | Data Integration
Introduction, Containment of conjunctive queries, Global-as-view mediation, Local-as-view mediation, Ontology-based mediators, Peer-to-Peer Data Management Systems. |
05 | 10 |
8 | Building Web scale applications
Web search, web crawlers, web information retrieval, Web graph mining and hot topics in web search, Distributed systems, failure management, Required properties of a distributed system, P2P networks, Hash-based structures, distributed indexing, Distributed computing with MapReduce. |
07 | 20 |
Suggested Specification table with Marks (Theory):
Distribution of Theory M | arks | |||
R Level | U Level | A Level | N Level | E Level |
10 | 18 | 18 | 18 | 06 |
Legends: R : Remembrance ; U = Understanding; A = Application; N = Analyze; E = Evaluation and above Levels (Revised Bloom’s Taxonomy) Reference Books:
- Serge Abiteboul, Ioana Manolescu, Philippe Rigaux, Marie-Christine Rousset and Pierre Senellart,
“Web Data Management”, Cambridge University Press, 2011
- Bhavani Thuraisingham, “Web Data Management and Electronic Commerce”, CRC Press, 2000
- Bhavani Thuraisingham, “XML Databases and the Semantic Web”, CRC Press, 2002
- Athena Vakali and George Pallis, “Web Data Management Practices: Emerging Techniques and
Technologies”, IGI Publishing, 2007, ISBN-10: 1599042282; ISBN-13: 978-1599042282
Course Outcome:
After learning the course, the student will be able:
- To understand the overall vision of the Semantic Web
- To analyze the current technology stack (URIs, XML, RDF/S, OWL)
- To understand how one could use these technologies for building something useful
- To define and test an ontology
- To define schema mappings
- To install and use tools for semantic data management
List of Experiments:
- Create an XML file defining an article in newspaper.
- Create an XML file containing list of students. Also create stylesheet file to display list in an HTML format.
- Create an XML file containing list of students. Using XPath display following information
- Information of a student with ID No : 101
- All the student in the sorted order according to their CGPA
- Create an XForm to collect information from staff member regarding their publications. Details like Year of Publication, National/International, Title, Conference/Journal etc.
- From the above gathered information, using XQuery find out the number of publication in a specific year.
- Demonstrate the use of AJAX.
- Study of XMLSPY tool.
- Create an RSS for the events occurring in your institute.
- Write a program to read the articles in RSS created in above practical.
- Study of RDF (Resource Description Framework)
Design based Problems (DP)/Open Ended Problem:
- Create a system which can use of Web search, web crawlers and web information retrieval.
- Analyze and implement a system with Web graph mining.
- Implement and Subscribe RSS News feeds to get latest news in India.
Major Equipment:
XMLSpy, RSS Feed, RSS Reader.
List of Open Source Software/learning website:
ACTIVE LEARNING ASSIGNMENTS: Preparation of power-point slides: which include videos, animations, pictures, graphics for better understanding theory and practical work. The faculty will allocate chapters/ parts of chapters to groups of students so that the entire syllabus can be covered. The power-point slides should be put up on the web-site of the College/ Institute, along with the names of the students of the group, the name of the faculty, Department and College on the first slide. The best three works should submit to GTU.