Trafodion: 针对HBase的SQL事务支持
2014-10-31 13:49
232 查看
Introduction
Trafodion is an open source initiative from HP, incubated at HP Labs
and HP-IT, to develop an enterprise-class SQL-on-HBase solution
targeted for big data transactional or operational workloads. HP has
developed transactional SQL technologies with more than 20 years of
investment into database technology and solutions. Trafodion brings
this core technology to the Hadoop ecosystem. The name 'Trafodion'
(the Welsh word for transactions, pronounced ‘Tra-vod-eee-on':) was
chosen specifically to emphasize the differentiation that Trafodion
provides in closing a critical gap in the Hadoop ecosystem. To find out
more about the origin and the name of the project, please visit
www.hp.com/go/trafodion.
Target workloads
Hadoop workloads span from long-running batch mode to low-latency
operational workloads as shown in the figure below. The three
categories on the right side are analytic workloads and are regarded as
well-suited for Hadoop and therefore have garnered the most attention.
In contrast, the leftmost workload defined as “Operational” is a new class
of workloads that encompasses OLTP workloads as well as transactions
that include social and mobile data interactions and observations using
a mixture of structured and semi-structured data.
![](http://img.blog.csdn.net/20141031134824182?watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQvaHVhbmdnYW5nMDI4/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/Center)
Traditionally, these workloads have been handled by relational
databases. But, relational databases have scalability issues and do not
provide schema flexibility required in certain cases. Hadoop addresses
these limitations. Combined with Hadoop’s perceived benefits of
significantly reduced costs, there is growing interest and pressure to
embrace these workloads in the Hadoop ecosystem.
As operational workloads represent business needs, they typically consist
of a constant flow of transactions requiring low- latency response times
for read/write access. Additionally, these workloads are characterized by:
• Data integrity with ACID-compliant protection
• High availability, concurrency and scalability
• Multi-structured data
• Rapidly evolving data requirements
Features
Currently, there is no existing open source SQL-on-HBase solution
that adequately meets these requirements. Trafodion provides the
following functionality to support transactional workloads in
Hadoop:
• ACID-compliant distributed transaction protection
over multiple SQL statements, tables and rows
• Rich, full-functioned ANSI SQL language support using
ODBC/JDBC connectivity interfaces
• Performance improvements for transactional
workloads by leveraging compile-time and run-time
optimizations
• Support for large data sets using parallel-aware
query optimizer
Trafodion intends to leverage the full capabilities of Hadoop
ecosystem:
• Schema flexibility provided by HBase column family
structures
• Snapshot capability with versioning support in Hadoop
• High Availability and Disaster Recovery support with
replication and snapshotting capabilities
Benefits
Trafodion delivers a full-featured and optimized
transactional SQL-on-HBase DBMS solution with full
transactional data protection. These capabilities help
overcome Hadoop’s weaknesses in terms of supporting
transactional workloads.
![](http://img.blog.csdn.net/20141031134847785?watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQvaHVhbmdnYW5nMDI4/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/Center)
With Trafodion, customers gain the following benefits:
• Ability to leverage SQL expertise versus complex MapReduce
programming
• Seamless support for existing transactional applications
• Ability to develop next generation highly scalable, real-time
transaction processing applications
• Reduction in data latency for down-steam analytic workloads
And they also gain the following benefits inherent in Hadoop
ecosystem:
• Reduced infrastructure costs
• Massive scalability and granular elasticity
• Improved data availability and disaster recovery protection
Trafodion: Transactional SQL on HBase
Architecture
The Trafodion software architecture consists of three distinct layers:
the client layer, the SQL database services layer, and the storage
engine layer as shown in the figure below.
![](http://img.blog.csdn.net/20141031135001156?watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQvaHVhbmdnYW5nMDI4/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/Center)
The first layer is the Client Services layer where the application resides
and accesses the Trafodion database via standard ODBC/JDBC
interface using a Trafodion-supplied Windows or Linux client driver.
The second layer is the SQL layer where Trafodion provides a relational
schema abstraction on top of HBase, encapsulating all of the services
required for managing Trafodion database objects. These services
include connection management, transaction management, optimized
plan generation, and execution against Trafodion database objects.
Trafodion features a mature query optimizer that can generate parallel
query plans, eliminating the need for complex MapReduce programming
development.
The third layer is the Storage Engine layer which consists of standard
Hadoop services including HBase, HDFS, and Zookeeper. Trafodion
database objects are stored in native Hadoop (HBase/HDFS) database
structures. Trafodion handles the mapping of SQL requests into native
HBase calls transparently on behalf of the application.
Key innovations
Trafodion’s Distributed Transaction Management (DTM) component
provides protection to transactions spanning multiple SQL statements,
multiple tables, or multiple rows of a single table. Additionally,
Trafodion DTM provides protection in a distributed cluster configuration
across multiple HBase regions using an inherent two-phase commit
protocol. DTM provides support for implicit (auto-commit) and explicit
(BEGIN, COMMIT, ROLLBACK WORK) transaction control.
Trafodion provides many compile-time and run-time optimizations for
varying transactional workloads ranging from singleton row accesses
for OLTP-like transactions to highly complex SQL statements used for
operational reporting purposes.
Trafodion is an open source initiative from HP, incubated at HP Labs
and HP-IT, to develop an enterprise-class SQL-on-HBase solution
targeted for big data transactional or operational workloads. HP has
developed transactional SQL technologies with more than 20 years of
investment into database technology and solutions. Trafodion brings
this core technology to the Hadoop ecosystem. The name 'Trafodion'
(the Welsh word for transactions, pronounced ‘Tra-vod-eee-on':) was
chosen specifically to emphasize the differentiation that Trafodion
provides in closing a critical gap in the Hadoop ecosystem. To find out
more about the origin and the name of the project, please visit
www.hp.com/go/trafodion.
Target workloads
Hadoop workloads span from long-running batch mode to low-latency
operational workloads as shown in the figure below. The three
categories on the right side are analytic workloads and are regarded as
well-suited for Hadoop and therefore have garnered the most attention.
In contrast, the leftmost workload defined as “Operational” is a new class
of workloads that encompasses OLTP workloads as well as transactions
that include social and mobile data interactions and observations using
a mixture of structured and semi-structured data.
Traditionally, these workloads have been handled by relational
databases. But, relational databases have scalability issues and do not
provide schema flexibility required in certain cases. Hadoop addresses
these limitations. Combined with Hadoop’s perceived benefits of
significantly reduced costs, there is growing interest and pressure to
embrace these workloads in the Hadoop ecosystem.
As operational workloads represent business needs, they typically consist
of a constant flow of transactions requiring low- latency response times
for read/write access. Additionally, these workloads are characterized by:
• Data integrity with ACID-compliant protection
• High availability, concurrency and scalability
• Multi-structured data
• Rapidly evolving data requirements
Features
Currently, there is no existing open source SQL-on-HBase solution
that adequately meets these requirements. Trafodion provides the
following functionality to support transactional workloads in
Hadoop:
• ACID-compliant distributed transaction protection
over multiple SQL statements, tables and rows
• Rich, full-functioned ANSI SQL language support using
ODBC/JDBC connectivity interfaces
• Performance improvements for transactional
workloads by leveraging compile-time and run-time
optimizations
• Support for large data sets using parallel-aware
query optimizer
Trafodion intends to leverage the full capabilities of Hadoop
ecosystem:
• Schema flexibility provided by HBase column family
structures
• Snapshot capability with versioning support in Hadoop
• High Availability and Disaster Recovery support with
replication and snapshotting capabilities
Benefits
Trafodion delivers a full-featured and optimized
transactional SQL-on-HBase DBMS solution with full
transactional data protection. These capabilities help
overcome Hadoop’s weaknesses in terms of supporting
transactional workloads.
With Trafodion, customers gain the following benefits:
• Ability to leverage SQL expertise versus complex MapReduce
programming
• Seamless support for existing transactional applications
• Ability to develop next generation highly scalable, real-time
transaction processing applications
• Reduction in data latency for down-steam analytic workloads
And they also gain the following benefits inherent in Hadoop
ecosystem:
• Reduced infrastructure costs
• Massive scalability and granular elasticity
• Improved data availability and disaster recovery protection
Trafodion: Transactional SQL on HBase
Architecture
The Trafodion software architecture consists of three distinct layers:
the client layer, the SQL database services layer, and the storage
engine layer as shown in the figure below.
The first layer is the Client Services layer where the application resides
and accesses the Trafodion database via standard ODBC/JDBC
interface using a Trafodion-supplied Windows or Linux client driver.
The second layer is the SQL layer where Trafodion provides a relational
schema abstraction on top of HBase, encapsulating all of the services
required for managing Trafodion database objects. These services
include connection management, transaction management, optimized
plan generation, and execution against Trafodion database objects.
Trafodion features a mature query optimizer that can generate parallel
query plans, eliminating the need for complex MapReduce programming
development.
The third layer is the Storage Engine layer which consists of standard
Hadoop services including HBase, HDFS, and Zookeeper. Trafodion
database objects are stored in native Hadoop (HBase/HDFS) database
structures. Trafodion handles the mapping of SQL requests into native
HBase calls transparently on behalf of the application.
Key innovations
Trafodion’s Distributed Transaction Management (DTM) component
provides protection to transactions spanning multiple SQL statements,
multiple tables, or multiple rows of a single table. Additionally,
Trafodion DTM provides protection in a distributed cluster configuration
across multiple HBase regions using an inherent two-phase commit
protocol. DTM provides support for implicit (auto-commit) and explicit
(BEGIN, COMMIT, ROLLBACK WORK) transaction control.
Trafodion provides many compile-time and run-time optimizations for
varying transactional workloads ranging from singleton row accesses
for OLTP-like transactions to highly complex SQL statements used for
operational reporting purposes.
相关文章推荐
- SqlServerHelper,支持批量插入,事务,DataTable转实体,转实体类要用到的类,请查看往期随笔-- c# +泛型+自定义属性 实现 DataTable转换成实体类
- Trafodion:Transactional SQL on HBase
- c# 数据库访问静态类 支持多条SQL语句事务操作
- Trafodion——一个成熟的企业级SQL on HBase解决方案
- Trafodion成熟的SQL on HBase解决方案
- SQL事务对并发处理的支持
- maven编译ycsb0.1.4支持针对hbase性能测试,解决not a host:port pair问题,附下载地址
- 较全面的php mysql封装,使用mysqli对象支持预处理和事务,可输出执行后的sql
- 关于mybatis支持写多条sql语句,支持事务
- sql 2005存储过程触发器出现 SQLNCLI" 返回了消息 "该伙伴事务管理器已经禁止了它对远程/网络事务的支持”解决
- nodejs 针对 mysql 设计的原型库,支持事务/共享多次/单次查询
- Trafodion:Transactional SQL on HBase
- 更新DataSet,遇到“对于不返回任何键列信息的 SelectCommand 不支持 UpdateCommand 的动态 SQL 生成
- Hibernate3.0版支持经典SQL语句
- T_SQL中的事务处理
- 让MYSQL支持事务处理
- 实现数据库事务的简单方法ExecuteSqlTran()
- 对于“不返回任何键列信息的 SelectCommand 不支持 UpdateCommand 的动态 SQL 生成”问题的解决
- Sqldemo备份与恢复路径不支持空格
- MS SQL Server:事务处理概念和 MS DTC 概述 (简述)