6 el tercer "por qué" (parte 1)

38
The Third “Why” Julio Faerman 2014-09-29 GX24 http://jfaerman.com.br/gx24

Upload: genexus

Post on 19-Jun-2015

468 views

Category:

Technology


0 download

DESCRIPTION

El tercer "por qué" (Parte 1)

TRANSCRIPT

  • 1. The Third Why Julio Faerman 2014-09-29 GX24 http://jfaerman.com.br/gx24

2. https://startwithwhy.com 3. Security Availability Compliance Fault Tolerance Throughput Latency 4. but what is the difference? 5. 16 years 2000+ employees 40 million user http://aws.amazon.com/solutions/case-studies/netflix/ http://www.enotechconsulting.com/2013/04/aws-s3-behind-netflix-success/ http://variety.com/2014/digital/news/netflix-youtube-bandwidth-usage-1201179643/ Amazon Web Services for 100% of Streaming 34.2% of all downstream during primetime 6. Amazon Simple Storage Service Durable, scalable and fast storage (99.999999999%) 2+ Trillion (1012) objects 1.1+ Million RPS Native HTTP/S Full featured: Permissions, Static Hosting, Logging, Versionamento, Archival and Expiration Lifecycle, Torrent, Tags, Redundancy, Requester Pays, Criptography, Reduced Redundancy and more 7. 1. Low, pay-as-you-go pricing with no up-front expenses or long-term commitments. 2. Instantly deploy new applications, scale up as your workload grows, and scale down based on demand. http://aws.amazon.com/about-aws/ 8. We will make electricity so cheap that only the rich will burn candles. Thomas Edison The Big Switch: http://amzn.com/039334522X 9. Day 1 10. http://aws.amazon.com/solutions/case-studies/ 11. Fear, Uncertainty and Doubt Topsy Elephant: https://www.youtube.com/watch?v=eh_mJfWKNTI 12. http://youtu.be/GRVPGC1haTM 13. Security Compliance Capacity Fault Tolerance Cost Complexity Billing Scalability Availability Latency Throughput 14. Proof of Concept Quantitative > Qualitative Iterative Incremental 15. http://www.infoq.com/presentations/JPL-cloud 16. JPL Missions 17. Internet of Things? 18. Batch Big Data 19. Streaming Big Data 20. How unique data systems are? http://nathanmarz.com/blog 21. 3 Interfaces to Amazon Web Services Console, CLI, SDK 22. Amazon Kinesis Real-time processing of streaming data High Throuput and Elastic Integrate with Amazon S3, Amazon Redshift, and Amazon DynamoDB Locking, Sharding, Rollback and more with Kinesis Client Library Dashboard CEP Storage 23. Amazon Elastic MapReduce Distributed processing with Apache Hadoop 24. Petabyte Scale Data Warehousing Massively parallel OnLine Analytic Processing Resizable without downtime Managed provisioning and administration Compatible with PostgreSQL Amazon Redshift 25. Amazon Redshift Architecture Leader Node SQL endpoint Stores metadata Coordinates query execution Compute Nodes Local, columnar storage Execute queries in parallel Load, backup, restore via Amazon S3; load from Amazon DynamoDB or SSH Two hardware platforms Optimized for data processing DW1: HDD; scale from 2TB to 1.6PB DW2: SSD; scale from 160GB to 256TB 10 GigE (HPC) Ingestion Backup Restore JDBC/ODBC 26. ETL from EMR/Hive to Amazon Redshift trough Amazon S3 EMR S3 Redshift Extract & Transform Load Unstructure d Unclean Structured Clean Columnar Compressed 27. 7+ Billion 28. ~50 to ~3500 Instances in 3 days 29. Amazon Auto Scaling Adjust capacity to demand 30. 280+ Releases in 2014 http://aws.amazon.com/new http://aws.amazon.com/blogs/aws 31. Where to begin? 32. http://aws.amazon.com/training/intro_series/ 33. http://aws.amazon.com/training/ 34. Julio Faerman [email protected] http://jfaerman.com.br/gx24 Thank you! Questions?