apache solr 5.0 and beyond

17
Apache Solr - 5.0 and beyond Anshum Gupta Apache Lucene/Solr PMC Member and Committer

Upload: anshum-gupta

Post on 16-Jul-2015

1.556 views

Category:

Software


5 download

TRANSCRIPT

Apache Solr - 5.0 and beyondAnshum Gupta

Apache Lucene/Solr PMC Member and Committer

• Anshum Gupta, Apache Lucene/Solr PMC member and committer, Lucidworks Employee.

• Interested in search and related stuff.

• Apache Lucene since 2006 and Solr since 2010.

• Organizations I am or have been a part of:

Who am I?

• Apache Lucene is a free open source information retrieval software library

• Originally written in Java by Doug Cutting.

• It is supported by the Apache Software Foundation and is released under the Apache Software License.

What is Lucene?

• Solr (pronounced "solar") is an open source enterprise search platform

• Written in Java,

• For a while now, a part of the Apache Lucene project.

• Search on Lucene - Replicated (SoLR)

• SolrCloud - Distributed feature set

What is Solr?

Apache Solr is the most widely-used search solution on the planet.

Solr has tens of thousands of applications in production.

You use everyday.

8,000,000+Total downloads

Solr is both established and growing.

250,000+Monthly downloads

2,500+Open Solr jobs and the largest

community of developers.

Apache Solr is also one of the most active open source projects out there

Activity statistics

30 Day Summary Mar 14 2015 — Apr 13 2015

12 Month Summary Apr 13 2014 — Apr 13 2015

160 Commits

23 Contributors

1440 Commits

31 Contributors

Annual commits up +126 (9%) via https://www.openhub.net/p/solr

Solr Feature Release Frequency

• Search - Full text, Geo-spatial

• Faceting - Values, Ranges, Pivots, etc.

• Suggestor, highlighting, auto-complete

• Pluggability

• and of course, Speed and Scalability

Solr Essentials

Title TextWhat’s new in Solr 5x?

• Get started in < 5 minutes

• APIs, and more APIs

• Schema

• Config

• Collections

• Auto* - Failover, leader election, addition of replica!

• One of the best official documentation, released almost with the code.

Ease of Use

• Thousands of collections - Apple

• Billions of Documents - Box

• High throughput and near real time - Bloomberg

• Impressive indexing performance: 150 k docs/sec per node

Scalability and Performance

Solr Scalability is unmatched

• Tons of tests and quality code

• Critical systems running in production

• Jepsen tests - Proven again!

• Independent benchmarking and testing

Reliability

• Analytics - Do more with your data!

• Distributed IDF

• It’s an app not a war!

Features and more!

Solr News

• Scalability

• Faster search - SOLR-6810

• Improved indexing - SOLR-6816

• Analytics - HyperLogLog - SOLR-6968

• Security - Authentication and Authorization framework - SOLR-7230

• And tons more!

What’s coming?

The largest Lucene/Solr conference in the world

OCT 13 - 16, 2015 AUSTIN, TX

CFP is open until May 8, 2015

For more details visit: http://lucenerevolution.org

Connect @

http://www.twitter.com/anshumgupta

http://www.linkedin.com/in/anshumgupta/

[email protected]