exploring data science - meetupfiles.meetup.com/19227507/trends in data science.pdf · exploring...
TRANSCRIPT
PLEASE NOTE: (1) Light on code this month!
(2) Images and some text has been borrowed from the inter webs. Apologies if I did not credit. Thanks for the info, no $$ were made but please be comforted in the fact that you’re making the world a smarter place!!
…DISCLAIMERS…
Machine LearningMachine learning is a type of artificial intelligence (AI) that provides computers with the ability to learn without being explicitly programmed.
Machine learning focuses on the development of computer programs that can teach themselves to grow and change when exposed to new data. source: whatis.techtarget.com/definition/machine-learning
Machine Learning Trends
Categories:
Algorithms
Data Storage
Languages
According to KD Nuggets:
Algorithms
According to Quora via Data Science Central:
Those top-10 algorithms for reference:1 C4.5 (Decision tree)2 k-Means3 Support Vector Machines (SVM)4 Apriori (Association rule learning)5 Expectation Maximization (EM)6 PageRank7 AdaBoost8 k-Nearest Neighbors (kNN)9 Naive Bayes10 Classification and Regression Tree
(CART/MART)
Algorithms
According to Kaggle:
• Decision Trees • Naive Bayes • Least Squares
Regression • Logistic Regression • Ensemble Methods • Neural Networks
Algorithms
Why do I keep hearing about deep learning and neural networks?
New buzzwords maybe?
The cool kids are all doing it…Google TensorFlow
IBM Watson
Apple Accelerate
Popularity by AWS:
1. S3 (Simple Storage Service)2. Glacier (Archival Storage)3. EBS (Elastic Block Store - persistent)4. EC2 (Instance Storage - temporary)5. Storage Gateway6. RDS (Relational)7. DynamoDB (NoSQL)8. SQS (Simple Queue Service)9. Elasti-Cache (Caching Service)10. Redshift (BI)
Data Storage
Popularity by discussion on Stack Overflow:
Data Storage
db-engines.com:
Data Storage
NoSQL Only
Stack Overflow Developer Survey for Data & Math
Languages
According to GitHub
Languages
KD Nuggets - Data Science
Languages
…So, what’ does it mean?
…IMHO…
1. Trends on storage and applications to cloud providers
2. Python & R for Data Scientists, Javascript remains strong
3. Open source products used for innovation & startups
4. Established products for established
5. Specific use technology popular in niches
Sources• whatis.techtarget.com/definition/machine-
learning• http://www.kdnuggets.com/images/top-10-
algorithms-data-scientists-used.jpg• https://www.quora.com/What-are-the-top-10-
data-mining-or-machine-learning-algorithms• http://www.datasciencecentral.com/profiles/
blogs/top-10-machine-learning-algorithms• http://www.kdnuggets.com/2016/08/10-
algorithms-machine-learning-engineers.html• http://www.kdnuggets.com/2015/12/harasymiv-
lessons-kaggle-machine-learning.html• http://playground.tensorflow.org• http://www.thegeekstuff.com/2016/02/aws-
storage-and-db/• http://db-engines.com/en/ranking• http://stackoverflow.com/questions/1270321/a-
full-list-of-all-the-new-popular-databases-and-their-uses
• http://stackoverflow.com/research/developer-survey-2016
• http://www.techworm.net/2016/09/top-10-popular-programming-languages-github.html
• https://github.com/blog/2047-language-trends-on-github
• http://www.kdnuggets.com/2016/06/big-data-science-deep-learning-software-associations.html
Thank you for coming!
if you have additional questions, please feel free to reach out:
[email protected] @RandallShanePhD