[ Home | Lab | New Students | Courses | Research | Publications | Activities ]

Back to Courses and Activities of Local Interest

Back to CSC 691D Home


CSC691D-003: Advanced Database Management

Call number: 281400
MW 4:45 - 6:00 PM Withers 328

Motivation

This course will address the advanced issues in modern database systems and applications. Databases underlie most complex computing systems. Major upcoming applications include scientific computing and enterprise integration. The present course will focus on the data-related issues in building, analyzing, and maintaining complex software systems. It will highlight the common concepts behind the different applications.

Increasingly, software systems that involve databases are heterogeneous. Traditionally, such systems are made to function in an unprincipled manner. This is because the simple approaches designed for small, centralized, homogeneous databases are ineffective and inappropriate for dealing with large, distributed, heterogeneous environments. Programmers often handcraft solutions, which distract them from their main objectives in scientific and business problem-solving and decision-support. However, the past few years have seen significant advances in techniques for operating and maintaining heterogeneous database systems. Consequently, expectations are rising in industry.

Highlights

This course will introduce several of the recent techniques that address complementary aspects of heterogeneity. It seeks to prepare students in the concepts that are important for leading-edge database management in industry, and for research into these problems. Highlights include agent programming, distributed objects, and workflow management using de jure or de facto standards such as Java, KQML, CORBA, and the WfMC (workflow management coalition) framework. KQML is an agent communication language; CORBA is a proposal of the Object Management Group (OMG), which includes about 500 software companies; WfMC is a subset of OMG. I am trying to obtain academic licenses for some upcoming products. The course will also examine and use software being developed by my research students.

The course has a prerequisite of CSC 542 or equivalent. Besides that requirement, it is totally self-contained. The grading scheme for the course will be flexible to accommodate both theoretically and practically inclined students, although everyone will learn the core theory and practice.

Topics

This is a list of topics that will be included in this course.

  1. Major application classes
    1. Scientific databases
    2. Enterprise integration
    3. Virtual enterprises
  2. Semantic interoperation
    1. Ontologies
    2. Extended transaction models
    3. Workflow management
  3. Information applications
  4. Agent techniques
    1. What is an agent?
    2. How to build an agent - infrastructure
    3. KQML and agent communication
    4. Search paradigms and coordination of agents
  5. Connectivity issues: CORBA
  6. Schematic interoperation
    1. Schema integration
    2. Formal reasoning about schemas
    3. Survey of tools
  7. Advanced Topics (only a few might actually be addressed)
    1. Unstructured or semistructured databases
    2. Scientific workflows
    3. Data warehousing
    4. Data and knowledge mining
    5. Information security
    6. Object-Oriented databases
    7. Multimedia databases

Plan

The course will have three phases (see associated schedule).
  1. I will teach from the course packet in the early part of the semester. Concurrently, we will define projects for teams of 3-4 students each.
  2. Students will give presentations and lead discussions on the papers listed below.
  3. I or various students will talk about CORBA. Students will implement some CORBA exercise, to be determined.

For presentations, students will form teams of 2 members. Each team will lead a class discussion on 2-4 related research papers. All students must read the assigned paper and will be graded on their participation. The discussion leaders will

  1. Describe its claims and approach (from the perspective of the authors)
  2. Evaluate it (from the perspective of the skeptical reader) with respect to
  3. Give their own insights into the subject
  4. Assign homework problems to the class (announce URL)
  5. Hand out solutions the following week

Remember that there is often a lot of detail in the papers, but you must summarize the essence and give your commentary in one class period.

Make about 20-30 transparencies for each class period (depends on how much is on each transparency). Bring copies for everyone.

Books

The papers to be covered from the books by Bukhres & Elmagarmid, and Kim are listed under readings. I don't expect to cover all of Siegel's book either. I believe that even the parts we don't cover from these books will come in handy to you in your database studies. Please exercise your discretion in buying these books.

  1. Required: Munindar P. Singh and Michael N. Huhns, CSC691D: Transparencies for Advanced Database Management (Cooperative Information Systems). Course Packet.
  2. Required: Selected Readings in Advanced Database Management. Course Packet.
  3. Required: Omran Bukhres and Ahmed K. Elmagarmid (eds.), Object-Oriented Multidatabase Systems: A Solution for Advanced Applications, Prentice Hall International, 1996. ISBN: 0-13-103813-2.
  4. Required: Won Kim (ed.), Modern Database Systems: The Object Model, Interoperability, and Beyond, Addison-Wesley, 1995. ISBN: 0-201-59098-0.
  5. Required: Jon Siegel, CORBA: Fundamentals and Programming, John Wiley, 1996. $49.95. ISBN: 0-471-12148-7.
  6. Optional: GMU's workflow module

Tentative Grading

Sitting in and audit requirements: must give a class presentation.
  1. Late midterm exam (late November): 20%
  2. Homework: 15%
  3. Class presentation and participation: 25%
  4. Project: 40%

Watch the page http://www4.ncsu.edu/eos/info/dblab/www/mpsingh/local/691d/ for updates.
singh@ncsu.edu