weekly report start learning gpu ph.d. student: leo lee date: sep. 18, 2009

25

Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Post on 20-Dec-2015

216 views

Category:

Documents

0 download

Report

Download

Tags:

Embed Size (px):

TRANSCRIPT

Page 1: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Weekly ReportStart learning GPU

Ph.D. Student: Leo Leedate: Sep. 18, 2009

Page 2: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Outline

• References

• CUDA

• Work plan

Page 3: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Outline

• References

• CUDA

• Work plan

Page 4: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

References

Page 5: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Frequent itemset mining on graphics

• Introduction– Two representative algorithms: Apriori and FP-growth;

• FP-growth were generally faster than Apriori;• Apriori-borgelt was slightly faster when the support was high;

– No prior work focuses on studying the GPU acceleration for FIM algorithms.

– Challenge: the data structure is not aligned and access patterns are not regular (pointer-chasing).

Page 6: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Frequent itemset mining on graphics

• Background and related work-GPGPU

– The parallel primitives [19] are a small set of common operations exploiting the architectural features of GPUs. We utilize map, reduce, and prefix sum primitives in our two FIM implementations.

– Improvement - Memory optimizations: • Local memory optimization for temporal locality• Coalesced access optimization of device memory for spatial locality• The built-in vector data type to reduce the number of memory access.

– Difference• we study the GPU acceleration of Apriori for FIM, which incurs much more

complex control fows and memory accesses than performing database joins or maintaining quantiles from data streams.

Page 7: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Page 8: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Frequent itemset mining on graphics

• Implementation

Page 9: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Frequent itemset mining on graphics

• Implementation

Page 10: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Frequent itemset mining on graphics

• Implementation-Pure Bitmap Implementation

Page 11: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Frequent itemset mining on graphics

• Implementation-PBIGiven m frequent (K ¡1)-itemsets, and n items. In order to check whether one (K ¡ 1)-itemset is frequent, we need to access (logm*(n/128)*16) bytes of data, where logm is the cost of performing a binary search, and (n/128)*16 is the size of a row (in bytes) in the bitmap of (K¡1)-itemsets. Typically, if m = 10000 and n = 10000, we need to access about 16 KB for checking only one (K ¡ 1)-subset. This problem in our pure bitmap- based solution triggers us to consider adopting another data structure in the Candidate Generation procedure in the presence of a large number of items.

Page 12: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Frequent itemset mining on graphics

• Implementation-Trie based ImplemetationThe candidate generation based on trie traversal is implemented on the CPU. This decision is based on the fact that, the trie is an irregular structure and difficult to share among SIMD threads. Thus, we store the trie representing itemsets in the CPU memory, and the bitmap representation of transactions in the GPU device memory.

Page 13: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Frequent itemset mining on graphics

• Implementation-TBI

Page 14: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Frequent itemset mining on graphics

• Experiments

Page 15: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Frequent itemset mining on graphics

• Experiments

Page 16: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Frequent itemset mining on graphics

• Results

Page 17: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Frequent itemset mining on graphics

• Results

Page 18: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Frequent itemset mining on graphics

• Results

Page 19: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Frequent itemset mining on graphics

• Results

Page 20: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Outline

• References

• CUDA

• Work plan

Page 21: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

CUDA

• Review the code of K-means – CPU: 1101 S (10 S)– GPU: still need debug, no results right now

Page 22: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Outline

• References

• CUDA

• Work plan

Page 23: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

Work Plan

• Summary this month

• Make plan for next month

• Try to implement a data mining algorithm

• Homework

Page 24: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

References

Key words Google scholar ACM portal

GPU decision tree 2,230 222

GPU k-means 388 184

GPU SVM 416 27

GPU Apriori 1,980 11

GPU Expectation Maximization

266 24

GPU PageRank 4,260 5

GPU AdaBoost 113 9

GPU k-nn 314 20

GPU Naive Bayes 104 2 (false positive)

GPU CART 1,040 3 (false positive)

Page 25: Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009

• Thanks for your listening

Computer Vision on GPU with OpenCV - Gipsa- · PDF fileComputer Vision on GPU with OpenCV ... •OpenCV GPU module •Face Detection on GPU •Pedestrian detection on GPU 2 . ... C++

GPU-SM: Shared Memory Multi-GPU Programmingimpact.crhc.illinois.edu/shared/Papers/gpusm_gpgpu8.pdf · GPU-SM: Shared Memory Multi-GPU Programming Javier Cabezas Barcelona Supercomputing

20150225 alea gpu professional .net gpu development

PacketShader - SIGCOMMconferences.sigcomm.org/sigcomm/2010/slides/S6Han.pdf2010 Sep. PacketShader: A GPU-Accelerated Software Router SangjinHan † In collaboration with: KeonJang

Efficient GPU Programming in Modern C++€¦ · Efficient GPU Programming in Modern C++ Gordon Brown Principal Software Engineer, SYCL & C++ CppCon 2019 –Sep 2019

WT-4065, Superconductor: GPU Web Programming for Big Data Visualization, by Leo Meyerovich and Matthew Torok

GPU Architecture: Implications & Trendss08.idav.ucdavis.edu/luebke-nvidia-gpu-architecture.pdf · GPU Architecture: Implications & Trends ... GPU architecture increasingly centers

GPU-to-GPU and Host-to-Host Multipattern String Matching on a GPU

Cygnus: GPU meets FPGA for HPC - RIKEN R-CCS · 2020. 2. 27. · FPGA-GPU DMA (FPGA ← GPU) FPGA-GPU DMA (FPGA → GPU) direction via CPU FPGA-GPU DMA GPU→FPGA 17 1.44 FPGA→GPU

GPU-Based Hierarchical Texture Decompression short2staff.elka.pw.edu.pl/.../GPU-Based_Hierarchical_Texture_Decompression.pdf · GPU-Based Hierarchical Texture Decompression J. Stachera

LEO Fou ndationleo-foundation.org/wp-content/uploads/2018/03/LEO-Foundation... · ANNUAL REPORT 2017 – LEO FOUNDATION 4 About LEO Foundation About LEO Foundation – The LEO Foundation

LEO/LEO XXL VFD - spiderstaging.com · 800-644-6478 | PO Box 2750 Melbourne | FL 32902 | LEO/LEO XXL VFD Leo 3 Phase: 50-2 | Leo 1 Phase: 50-40 | Leo XXL: 50-25

PyCUDA: Even Simpler GPU Programming with Python · Python Code GPU Code GPU Compiler GPU Binary GPU Result Machine Human In GPU scripting, GPU code does not need to be a compile-time

GPU Architecture & Implications - Computer Scienceskadron/cuda_asplos08_tutorial/4-GPU-architecture.pdf · GPU Architecture CUDA provides a parallel programming model The Tesla GPU

CMPT454 GPU Managed Database · GPGPU: General Purpose GPU, using GPU for usual CPU usage. Outline 1. GPU VS CPU 2. GPU Implementation 3. Products 4. Future Holds. GPU VS CPU. GPU

2010 Sep. PacketShader: A GPU-Accelerated Software Router Sangjin Han † In collaboration with: Keon Jang †, KyoungSoo Park ‡, Sue Moon † † Advanced Networking

Optimizing GPU to GPU Communication on Cray XK7

GPU Computing with MATLAB - GPU Technology Conference

LEO/LEO XXL - spiderstaging.com · 800-644-6478 | PO Box 2750 Melbourne | FL 32902 | LEO/LEO XXL Leo 3 Phase: 50-2 | Leo 1 Phase: 50-40 | Leo XXL: 50-25 H 110/400ft W 1000/2000lbs

OpenCV on a GPU · OpenCV GPU header file Upload image from CPU to GPU memory Allocate a temp output image on the GPU Process images on the GPU Process images on the GPU Download

Multi-GPU Programming - GPU Technology Conference

GPU Acceleration for Seismic Interpretation Algorithmsdeveloper.download.nvidia.com/GTC/...GTC2012-GPU-Acceleration-Sei… · GPU Acceleration for Seismic ... GPU Acceleration for

Leo 3W, Leo 4W - Invacare

GPU Architecture and Programming. GPU vs CPU

Leo Club Of Rit Activities Pictures Sep 2009

Extending Unified Parallel C for GPU Computing · PGAS Programming Model for Hybrid Multi-Core Systems Computer Node CPU Memory GPU GPU Memory CPU CPU GPU GPU Memory Computer Node

Taming GPU Threads with F# and Alea GPU · Taming GPU Threads with F# and Alea GPU ... Alea Reactive Dataflow ... 20141105_Taming GPU threads with Fshap

GPU, GP-GPU, GPU computing

GPU Physics - Nvidiadeveloper.download.nvidia.com/.../siggraph/gpu_physics-siggraph-06.pdf · NVIDIA GPU Physics Multi-GPU configurations, mixed or same GPU type One GPU does both

Boston Fire Department Firefighting Force Personnel ......Madden Joseph Leo Lieutenant Feb 1, 1924 Sep 1, 1954 Maddock Norman Leo Fire Fighter Mar 12, 1947 Feb 1, 1979 Madigan John

ICMP Process Flowcharts (14x20) Final Version 2 (Sep 2019) v3 · FLOWCHART A INITIAL CONTACT WITH CICI- FOR USE OF LAW ENFORCEMENT OFFICER (LEO) Child is 15 YRS OLD or below LEO:

Build GPU Cluster Hardware for Efficiently Accelerating ... · Hardware GPU dense HPC Cluster CPU Host RAM GPU #5 PHB IB Card Host RAM CPU GPU #0 PHB IB Card GPU #1 GPU #4 GPU #2

GPU Benefits for Earth System Science › sites › default › files... · GPU Benefits for Earth System Science. 2 TOPICS ... 19km GPU 19km CPU 1.9km GPU.93km GPU ... NOAA FV3 GPU

GPU Computing with Matlab® @ CBI Laboratory. Overview GPU History & Hardware – GPU History – CPU vs. GPU Hardware – Parallelism Design Points GPU Software

Sep 11, 2009 Automatic Transformation of Applications onto GPUs and GPU Clusters PhD Candidacy presentation: Wenjing Ma Advisor: Dr Gagan Agrawal The