January/February 2018 issue of acmqueue

The January/February issue of acmqueue is out now

Semi-structured Data

  Download PDF version of this article PDF

ITEM not available


Originally published in Queue vol. 3, no. 8
see this item in the ACM Digital Library



Andrew McCallum - Information Extraction
Distilling structured data from unstructured text

Alon Halevy - Why Your Data Won't Mix
When independent parties develop database schemas for the same domain, they will almost always be quite different from each other. These differences are referred to as semantic heterogeneity, which also appears in the presence of multiple XML documents, Web services, and ontologies—or more broadly, whenever there is more than one way to structure a body of data. The presence of semi-structured data exacerbates semantic heterogeneity, because semi-structured schemas are much more flexible to start with. For multiple data systems to cooperate with each other, they must understand each other’s schemas.

Natalya Noy - Order from Chaos
There is probably little argument that the past decade has brought the “big bang” in the amount of online information available for processing by humans and machines. Two of the trends that it spurred (among many others) are: first, there has been a move to more flexible and fluid (semi-structured) models than the traditional centralized relational databases that stored most of the electronic data before; second, today there is simply too much information available to be processed by humans, and we really need help from machines.

C. M. Sperberg-McQueen - XML
XML, as defined by the World Wide Web Consortium in 1998, is a method of marking up a document or character stream to identify structural or other units within the data. XML makes several contributions to solving the problem of semi-structured data, the term database theorists use to denote data that exhibits any of the following characteristics:


(newest first)

mamo | Sat, 14 Nov 2009 13:02:31 UTC

I fully apretiate about the discusion you have. but one thing that is ,identification about the advantage and dis advantages of interview. thankyou

Leave this field empty

Post a Comment:

© 2018 ACM, Inc. All Rights Reserved.