翻译一段计算机英语（不要机器翻译和在线翻译！！！！）

显示全部楼层 · 2008-6-1 00:57:06

Electronic documents are often legal, historic, or business
transaction records, and queries against such documents typically
involve entities and relationships that represent features of the text
itself as well as features of the businesses involved in the
contractual agreements. For an XML database one fundamental
semantic issue is document equivalence [40]: when are two
documents or document parts or document DTDs the same? For
example, before inserting a document into the database, we might
want to find out if the same document is already in the database.
The question of equivalence is important in satisfying
requirements for evidence and archiving, for version management,
for metadata management, and (as is true of all forms of data) for
query optimization.
The XML 1.0 specification does not define equality of documents
or equality of entities, nor do the Infoset, XPath, or DOM models.
The XQuery 1.0 and XPath 2.0 Data Model includes one equality
operator to test node identity and another to test equality of
values. However semantics for the equality of node values does
not encompass all data from XML documents. W3C has proposed
that Canonical XML [10] be used to compare the equivalence of
two documents. The canonical form is created by a process called
canonicalization either from an XPath node set or an octet stream
containing a well-formed XML document. In both cases
canonicalization omits some of the information in the original
XML document. Since such a canonical form does not contain all
information from an XML document, this definition of
equivalence may not satisfy all applications’ needs. One solution
is to define document equivalence in terms of a model that
includes all document features, after which application-dependent
definitions of equivalence can be specified by applying document
equivalence to application-specific transformations of the
documents to be compared.

千问 · 2008-6-1 00:57:06

电子文件往往是法律，历史，或业务交易纪录，并查询这些文件通常涉及实体和关系的代表特征的文本本身以及特点，企业参与，在合同协议。一个XML数据库的一个根本语义问题是文件的等价[ 40 ] ：当两个文件或文件的部分或文件dtds相同呢？为了举例来说，前插入一个文件到数据库中，我们可能想要找出如果同一文件，是已经在数据库中。问题的等价性是很重要的，在满足要求的证据和归档，版本管理，为元数据管理，及（如真实的一切形式的数据）查询优化。该XML 1.0规范没有界定平等的文件或平等的实体，也没有Infoset的，的XPath ，或DOM的模型。有关的XQuery 1.0和XPath 2.0数据模