Transaction Processing over XML
TPoX (Transaction Processing over XML)
Transaction Processing over XML (TPoX) is a benchmark for XML database systems. TPoX is an application-level benchmark that simulates a financial application scenario. The goal of TPoX is to allow database designers, developers and users to evaluate the performance of XML database systems. The focus areas include the XML query languages XQuery and SQL/XML, but also XML storage, XML indexing, XML Schema support, XML updates, logging, and concurrency control.
TPoX was originally developed and tested by IBM and Intel, but became an open source project on sourceforge.net in January 2007. An updated version, TPoX 1.1, was released in June 2007. TPoX also proposes XML updates based on the XQuery Update Facility.
The TPoX benchmark exercises the processing of data-centric XML, as opposed to content- or document-centric XML.
The TPoX benchmark package contains the following:
- XML Schemas that define the XML data used in the benchmark.
- An XML data generation tool to generate an arbitrary number of XML documents with well-defined value distributions and referential integrity across documents.
- Workloads which are executed on the generated data. A workload is a set of transactions. A transaction can be a query in XQuery or SQL/XML notation or an insert, update or delete operation.
- A Java application which acts as a workload driver. It is configurable and can spawn 1 to n parallel threads to simulate concurrent database users. Each user connects to the database and executes a random sequence of transactions defined in the workload. Parameter markers in the transactions are replaced by real values that are drawn from random value distributions. The workload driver collects and reports performance metrics, such as the transaction throughput as well as mininum, maximum and average response times.
- Documentation.
References
- Ron Bourret's list of XML datrabase benchmarks
- New XML benchmark based on ToXgene
- A stub on the German Wikipedia
- An XML transaction processing benchmark, Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data
- DB2 9 XML performance characteristics
- TPoX is referenced as [16] in a research paper from the University of Kaiserslautern, Germany