Solandra:基于Solr和Cassandra的实时分布式搜索引擎
2011-11-24 16:16
323 查看
Solandra,从别名上就能看出来,其实它就是结合了 Solr 与 Cassandra 的实时搜索引擎程序。
其特性如下:
支持Solr的大多数默认特性 (search, faceting, highlights)
数据复制,分片,缓存及压缩这些都由Cassandra来进行
Multi-master (任意结点都可供读写)
实时性高,写操作完成即可读到
Easily add new SolrCores w/o restart across the cluster 轻松添加及重启结点
这是来自官方的介绍:
Solandra is a real-time distributed search engine built on Apache Solr and Apache Cassandra.
At its core, Solandra is a tight integration of Solr and Cassandra, meaning within a single JVM both Solr and Cassandra are running, and documents are stored and disributed using Cassandra's data model.
Solandra makes managing and dynamically growing Solr simple(r).
For more information please see the wiki
Replication, sharding, caching, and compaction managed by Cassandra
Multi-master (read/write to any node)
Writes become available as soon as write succeeds
Easily add new SolrCores w/o restart across the cluster
From the Solandra base directory:
Now that Solandra is running you can run the demo:
Download your Cassandra distribution
Unzip it the directory of your choice
Run the following solandra ant task to deploy the necessary files into the unzipped dir
ant -Dcassandra={unzipped dir} cassandra-dist
You can now start Solr within Cassandra by using $CASSANDRA_HOME/bin/solandra command. Cassandra now takes two optional properties: -Dsolandra.context and -Dsolandra.port for the context path and the Jetty port.
http://wiki.apache.org/solr/DistributedSearch#Distributed_Searching_Limitations
其特性如下:
支持Solr的大多数默认特性 (search, faceting, highlights)
数据复制,分片,缓存及压缩这些都由Cassandra来进行
Multi-master (任意结点都可供读写)
实时性高,写操作完成即可读到
Easily add new SolrCores w/o restart across the cluster 轻松添加及重启结点
这是来自官方的介绍:
Solandra is a real-time distributed search engine built on Apache Solr and Apache Cassandra.
At its core, Solandra is a tight integration of Solr and Cassandra, meaning within a single JVM both Solr and Cassandra are running, and documents are stored and disributed using Cassandra's data model.
Solandra makes managing and dynamically growing Solr simple(r).
For more information please see the wiki
Requirements:
Java >= 1.6Features:
Supports most out-of-the-box Solr functionality (search, faceting, highlights)Replication, sharding, caching, and compaction managed by Cassandra
Multi-master (read/write to any node)
Writes become available as soon as write succeeds
Easily add new SolrCores w/o restart across the cluster
Getting started:
The following will guide you through setting up a single node instance of Solandra.From the Solandra base directory:
mkdir /tmp/cassandra-data ant cd solandra-app; bin/solandra
Now that Solandra is running you can run the demo:
cd http://www.cnblogs.com/reuters-demo ./1-download_data.sh ./2-import_data.sh While data is loading, open the file ./website/index.html in your favorite browser.
Embedding in an existing cassandra distribution
To use an existing Cassandra distribution perform the following steps.Download your Cassandra distribution
Unzip it the directory of your choice
Run the following solandra ant task to deploy the necessary files into the unzipped dir
ant -Dcassandra={unzipped dir} cassandra-dist
You can now start Solr within Cassandra by using $CASSANDRA_HOME/bin/solandra command. Cassandra now takes two optional properties: -Dsolandra.context and -Dsolandra.port for the context path and the Jetty port.
Limitations
Solandra uses Solr's built in distributed searching meachanism. Most of its limitations are covered here:http://wiki.apache.org/solr/DistributedSearch#Distributed_Searching_Limitations
相关文章推荐
- 基于Sphinx构建准实时更新的分布式通用搜索引擎平台
- 基于hadoop+nutch+solr的搜索引擎环境搭载<一>hadoop完全分布式环境搭建
- solr学习第六课---solr中facet的基本应用-基于solr搜索引擎
- solr学习第七课----solr之数据库数据导入生成索引(DataImportHandler)-基于solr搜索引擎
- 基于solr和zookeeper的分布式搜索方案
- 基于hadoop+nutch+solr的搜索引擎环境搭载<二>nutch+solr整合以及搭载在hadoop上
- 基于Docker 分布式部署solrCloud
- 基于Storm 分布式BP神经网络,将神经网络做成实时分布式架构
- 【课程分享】基于Lucene4.6+Solr4.6+Heritrix1.14+S2SH实战开发从无到有垂直搜索引擎
- 基于Cassandra的日志和分布式小文件存储系统【1】
- 一脸懵逼学习HBase---基于HDFS实现的。(Hadoop的数据库,分布式的,大数据量的,随机的,实时的,非关系型数据库)
- 跨数据库分布式实时事务 - 基于RabbitMQ实时消息队列服务实现
- 实战Solr分布式实时搜索集群搭建
- 搜索引擎-基于solrj客户端的solr增删改查 (附:大神博客链接)
- Cassandra——类似levelDB的基于p2p架构的分布式NOSQL数据库
- 分布式系统基于缓存机制的实时开关系统——可将一个指令同时推送给N个主机
- 基于Solr的淘宝商家交易数据实时查询方法
- 搜索引擎-基于solrj客户端的solr增删改查
- 基于Lucene的近实时搜索引擎优化总结