Abstract
Many image filtering techniques can be accelerated with compute unified device architecture (CUDA)-based massively parallel implementations. In this paper, we show the major issues on our acceleration techniques, and also its implementation details. We implemented various image filtering operations in our own CUDA kernel programs, and they are combined to build an artifact-detection scheme in PCB board soldering process. We designed our own computational logics on the artifact detection, and finally, the whole system figure out the potential artifact regions. Comparing with the central processing unit-based reference implementation, we show its correctness and feasibility, with much execution speed-ups.
Original language | English |
---|---|
Pages (from-to) | 749-755 |
Number of pages | 7 |
Journal | Cluster Computing |
Volume | 20 |
Issue number | 1 |
DOIs | |
State | Published - 1 Mar 2017 |
Keywords
- Acceleration
- Artifact detection
- CUDA
- Image filter
- Practical solution