the big picture: big data for the new wave of analytics
DESCRIPTION
The Briefing Room with Neil Raden and MarkLogic Live Webcast on Oct. 2, 2012 Understanding context is a critical success factor for any decision-maker. Getting a clear view of the big picture can help guide all kinds of important decisions. That's why many organizations are focused on weaving together structured and "unstructured" data, to create a strategic view of enterprise issues and opportunities. The answers are usually found somewhere in between a SQL query and a Google-style search. Check out this episode of The Briefing Room to learn from veteran Analyst Neil Raden of Hired Brains, who will explain how a new breed of analytical applications can generate a wide range of targeted insights. He'll be briefed by Steve Guttman of MarkLogic, who will tout his company's Enterprise NoSQL database, which combines the durability of traditional relational databases, with the versatility of modern Big Data engines. He'll also discuss real-world examples of new applications for various industries.TRANSCRIPT
Twitter Tag: #briefr
Twitter Tag: #briefr
! Reveal the essential characteristics of enterprise software, good and bad
! Provide a forum for detailed analysis of today’s innovative technologies
! Give vendors a chance to explain their product to savvy analysts
! Allow audience members to pose serious questions... and get answers!
Twitter Tag: #briefr
! November: Cloud
! December: Innovators
! January: Big Data
! February: Performance
! March: Integration
Twitter Tag: #briefr
! Traditionally, databases have been built around SQL, a declarative query language targeted at organizing data in two-dimensional tables.
! The ever increasing variety, volume and velocity of data has
taxed traditional relational databases and created performance bottlenecks, particularly around CPU, memory, disk I/O and network saturation.
! Non-relational or NoSQL alternatives have emerged to better
support extreme and diverse workloads without suffering hits in performance, while the incumbents increasingly add capabilities to stay competitive.
Twitter Tag: #briefr
Neil Raden is the founder and Principal Analyst at Hired Brains Research. He is the co-author, with James Taylor, of “Smart (Enough) Systems: How To Deliver Competitive Advantage by Automating Hidden Decisions,” Prentice Hall, 2007. With 30 years experience, he is a widely published writer, well-known speaker, analyst and consultant, having personally designed and implemented dozens of large analytical applications in finance, marketing, distribution, logistics, actuarial, intelligence, scientific, statistical and consumer products. As an industry analyst, he has published over 40 white papers, hundreds of articles, blogs and research reports. He welcomes your comments and can be reached at [email protected].
Twitter Tag: #briefr
! MarkLogic offers a schema-agnostic, enterprise-grade, NoSQL database technology.
! Version 6 includes integration with existing BI tools; REST and Java APIs; JSON, Search and Visualization enhancements; and In-database MapReduce and Analytics.
! Its shared-nothing architecture enables systems to scale linearly as demand increases, and allows customers the flexibility and extensibility to quickly tap into information assets.
Twitter Tag: #briefr
Steve Guttman is a MarkLogic’s VP of Product Management. He has a broad background in Software Marketing and Product Management for Client, Web and Enterprise markets. Steve helped develop several groundbreaking products, including Adobe Photoshop and the first browser-based spreadsheet. Most recently, Steve worked as Microsoft’s Product Unit Manager (the product GM) for Expression Web – a Web authoring tool created for Front-End Web developers. At Autodesk, he oversaw Product Management for the GIS and Civil Engineering division, and helped launched Autodesk Civil 3D – a next-generation model-based design tool for civil engineers, which quickly grew to be an industry-leading product. In 1999 Steve started Halfbrain.com, the first company to build and launch a browser-based spreadsheet and presentation program – over 6 years before Google Docs made its debut. Halfbrain.com merged with BI vendor, AlphaBlox (now part of IBM), where the web spreadsheet became a rich front end to Essbase and SQL Server Analysis Services.
Keeping in Context: A New Generation of Business Applications Steve Guttman, VP Product Management October 2, 2012
Slide 10 Copyright © 2012 MarkLogic® Corporation. All rights reserved.
Thank you for joining us
§ How Organizations Get Value From Big Data § The NoSQL Difference § Enterprise Ready Capabilities § Q&A
Agenda
Slide 11 Copyright © 2012 MarkLogic® Corporation. All rights reserved.
Real Value From Big Data
Create New Revenue Streams
Gain Insights to Increase Market Share
Make The World More Secure
Reduce Bottom Line Expense
Provide Access To Valuable Information
Slide 12 Copyright © 2012 MarkLogic® Corporation. All rights reserved.
The County of Fairfax delivers on their promise of open government.
Oracle Mainframe 30,000 documents
Transformation, Human & Machine
Enrichment
Word CAD
Fairfax County uses MarkLogic as its secure, all-source repository with easy-to-use search, including a self-service web portal.
Solution
Fast development process – live in 2 months. Lower system costs – shut down mainframes. Better information faster.
Benefits
Make it easier to access real-time information about zoning changes, land ordinances & property history.
Goals
Slide 13 Copyright © 2012 MarkLogic® Corporation. All rights reserved.
“Big Investment Bank” ensures their customers have newest, most up-to-date research.
Monetize their research, and gain competitive advantage with “first-to-publish” information.
Goals
Challenges Equity research authoring and delivery solution (replaced Oracle & FAST) that brings search, scalability, ease of use and speed to mobile and alerting applications.
Solution
Decreased development time by 88% (12 months to 3 weeks). Faster response time to hot topics has increased readership. Saved 50% over previous technology platform.
Benefits Search, insert, track, tag, all from
within familiar authoring tools
Slide 14 Copyright © 2012 MarkLogic® Corporation. All rights reserved.
The FAA enhances airline safety with real-time monitoring & dashboards.
Consolidated data for real-time monitoring
Weather reports
Social media Existing FAA
systems
Monitor and track emergencies and severe weather, and update a real-time dashboard for crisis management.
Goals
Proof of concept complete in 2 weeks. Entire system went from purchase to Beta in 6 months. Able to deliver mission-critical system that had previously failed on RDBMS.
Benefits
Challenges Repository for multi-source, complex, multi-structured data; supports geospatial enrichment, faceted navigation & search; integrated with Google Earth, Microsoft SharePoint & internal systems.
Solution
Slide 15 Copyright © 2012 MarkLogic® Corporation. All rights reserved.
We help the BBC team achieve and celebrate their summer of success.
Make the London 2012 the “most social” Olympics on record.
Goals
Challenges The system flexibly stores all data in a single repository from scores, to player bios to team history & tweets and makes all new data available in real-time.
Solution
BBC Sports data systems have been re-launched in just a few months earning critical praise for innovative features and the speed to add new sources.
Benefits
Slide 16 Copyright © 2012 MarkLogic® Corporation. All rights reserved.
Eliminate Relational Pain
The Relational Way
New data source = New schema
Search limited set of data
Transform data, force structure
Weeks to build new queries or add/change attributes
Wait for development resources to build new applications
The NoSQL Way
New data source = No problem
Search all of your data
Use data as is
Have real-time access to information
Rapid prototyping application development tools & APIs
Slide 17 Copyright © 2012 MarkLogic® Corporation. All rights reserved.
Multi-Lingual, Full Text & GeoSpatial Search
Full-Text Search Capabilities You Expect § Supports 200+ languages Advanced Capabilities for More Value § Proximity boosting & distinctive
terms § Phrase-through and phrase-around § Lexicon, custom dictionary,
thesaurus support Location Awareness in Content § Native integration of full-text and
geospatial information
Geospatical Visualization § Integrates with ESRI, Google
Earth, Google Maps, Yahoo Maps, Microsoft Live Search Maps.
§ Supports GML, KML, GeoRSS
Benefits ü More valuable answers with
complete view of data
ü Get information in real-time
ü High performance search API
ü Faceted navigation
ü Supports billions of documents
ü Use visual maps to help users refine results
<< Back
Slide 18 Copyright © 2012 MarkLogic® Corporation. All rights reserved.
BI Tools Support
Big Data Analysis for mere mortals § Perform sophisticated analytics
on ALL your data, in real-time
Leverage existing tools avoids added expenditures
§ No additional training § Reuse existing report templates § No custom integration or code
required
Simplify IT infrastructure § Integrated through ODBC – tested
on IBM Cognos & Tableau § No need to spend resources on
extracting data to a data warehouse
§ Real-time access to your operational database – gain competitive advantage
Benefits ü Makes your data more accessible
across your business
ü Faster results
ü Faster ROI
<< Back
Slide 19 Copyright © 2012 MarkLogic® Corporation. All rights reserved.
Application Builder & Visualization Widgets
Quick Development of Visual Interfaces § Easily build or enhance web
applications using MarkLogic tools and widgets
§ Simplify front-end architectures with pre-configured visualizations and data access helpers
Explore New Ways to Look at Data § Gain more insights from data sets
with visual representations § Charts, Maps, Search, Results,
Sidebar with facets
Benefits ü Simplifies development
ü Speeds time-to-deployment
ü Easy access to powerful geospatial and aggregation capabilities
ü Scalable, event-driven, encapsulated
<< Back
Slide 20 Copyright © 2012 MarkLogic® Corporation. All rights reserved.
Ensure You Don’t Create New Pain
Only Enterprise NoSQL Database ü ACID compliant ü Big data search ü High availability ü Replication ü Point in-time recovery ü Government-grade security ü Real-time your Hadoop ü Real-time Big Data analytics
Database Search
Application Services
Slide 21 Copyright © 2012 MarkLogic® Corporation. All rights reserved.
MarkLogic: Powerful. Accessible. Trusted.
§ Competitive Intelligence
§ Compliance
§ Content Delivery
§ Counterterrorism
§ Digital Asset Management
§ Fraud Detection
§ Reference Data Management
§ Search & Discovery
§ Social Media Analysis
100’s of Use Cases in Production and Development
Slide 22 Copyright © 2012 MarkLogic® Corporation. All rights reserved.
Any Questions?
Twitter Tag: #briefr
Hired Brains is an independent firm providing research and vendor advisory services and
direct-to-client consulting for 25 years
Neil Raden CEO and Founder, Hired Brains [email protected] Twitter: @neilraden Blog: http://hiredbrains.wordpress.com http://www.linkedin.com/in/neilraden
Copyright © 2010-2012 Hired Brains Inc. 25
Mark Logic, NoSQL and Big Data
• Relational database reign of 30 years is being challenged
• Needs for distributed document-centric stores
• Web-level data is not economical with existing RDBMS structure or licensing
• Wide variety in NoSQL such as XML, graph, column, object and in-memory and Hadoop
• Mark Logic is a distributed document-oriented XML system
Copyright © 2010-2012 Hired Brains Inc. 26
Data Doesn’t Speak for Itself
• Data is only a proxy for reality, footprints left behind
• Its meaning has to be understood
• Data integration provides meaning and context to data
• Happens in logical locations
• Analytics gives value to Big Data
Copyright © 2010-2012 Hired Brains Inc. 27
Willie Sutton: Infamous Bank Robber
Q: Willie, why do you rob banks? A: Because that’s where the money is.
Copyright © 2010-2012 Hired Brains Inc. 28
• 28
Twitter Tag: #briefr
! Just a technical question: how does all of that mixed media data become XML data during the ingestion phase?
! A follow up to that question: MarkLogic is praised for its amazing query response time, but how fast is the ingestion process to a disk-based forest?
! We hear a lot about Big Data — too much really — but despite the fact that much of the data is not traditional structured data, a lot of the analytics eventually resemble the quantitative analytics that preceded it. Because MarkLogic positions itself as a Big Data player, how does it support the deep quantitative analytics of Big Data?
Twitter Tag: #briefr
! I read a case study which mentions that MarkLogic provided scores and data visualization for the London Olympics. How does that work?
! MarkLogic describes its strength as providing context to decision-making. Apparently, you partner with traditional BI vendors like Cognos. How do you provide context to a BI tool?
! How expensive is MarkLogic? I understand you can build a cluster of the same commodity servers as Hadoop, but with a lot of servers, doesn’t the license cost become prohibitive in the same way most NoSQL vendors point out the cost of traditional relational databases?
Twitter Tag: #briefr
! I get the sense that the applications for MarkLogic have a strong feel for content understanding and search, but all organizations still strive to understand their operations in traditional data warehousing and BI methods. Can MarkLogic support this without the other infrastructure?
Twitter Tag: #briefr
Twitter Tag: #briefr
! This Month: Database
! November: Cloud
! December: Innovators
! January: Big Data
! 2013 Editorial Calendar (www.insideanalysis.com)
Twitter Tag: #briefr