r and d

Download R and D

If you can't read please download the document

Upload: kishkp

Post on 25-Sep-2015

219 views

Category:

Documents


2 download

DESCRIPTION

Big Data terms

TRANSCRIPT

SOLR - Enterprise Search Platform based on Lucene. Competition - Elastisearch, Vivisimo / Watson, Exalead etc, Marketing (developing tools to help marketing teams in SEO and SEM efforts) - Integrating with APIs (like Google Map APIs etc.) for Android, iOS and Windows / Mobile Apps for m-commerceCustomer cohortsMultivariate TestingWeb Services, SOA (Service Oriented Architecture), Graph Databases, REST APIGoogle Analytics / Web AnalyticsMarketing Analytics: Spends optimization, channel ROIsProduct analytics: User behavior, AB Testing, Design optimizationCategory analytics: Performance scorecards for categories, brands and merchantsLarge scale Data Processing: Map reduce, hadoop, hiveProgramming Language: Java, JSP /JSTL, Servlets, C/C++, ASP.net, SharePoint, MVC architecture (Spring 3.0 & Spring MVC), HadoopDatabase: SQL Server 2012Application Servers: Understanding of Tomcat, IIS Server, Apache Storm : Stream data engine - deals with each event specifically - uses Apache Zookeeper as the coordinator and not direct hadoop clustersApache Kafka : Messaging systemApache Spark : In memory distributed data analysis platform - near realtime and processes data as batches of eventsHBase / Cassandra : Non relational distriubuted DB used for sparse data. Uses NoSQL. Cassandra has a CQL that is modeled on SQL. Both are column oriented DBsOozie : Job Scheduling / workflow schedulerHue : Visualization tool / web interface for Hadoop dataAerospike : In memory noSQL DbPentaho : DW / BI toolVertica : In memory DB like Hana, Netezza, Greenplum, TeradataGraph DB Vendors : Neo4J, FlockDB by Twitter, GraphDB, Oracle Spatial and graph, Teradata Aster Document Oriented DB : Lotus Notes, MongoDB, Key Value Stores :