Yasuaki Itou
Last Updated :2025/04/03
- Affiliations, Positions
- Graduate School of Advanced Science and Engineering, Professor
- E-mail
- yasuaki
hiroshima-u.ac.jp
- Self-introduction
- High-performance computing, Parallel computing, AI, Machine learning, Quantum computing, Quantum chemistry calculations, Embedded systems.
Basic Information
Academic Degrees
- Doctor of Engineering, Hiroshima University
- Master of Information Science, Japan Advanced Institute of Science and Technology\, Hokuriku
Research Fields
- Informatics;Computing Technologies;Software
Research Keywords
- Parallel processing
- Reconfigurable computing
- GPGPU
- FPGA
Educational Activity
Course in Charge
- 2025, Graduate Education (Master's Program) , 2Term, Special Exercises on Informatics and Data Science A
- 2025, Graduate Education (Master's Program) , 3Term, Special Exercises on Informatics and Data Science B
- 2025, Graduate Education (Master's Program) , 4Term, Special Exercises on Informatics and Data Science B
- 2025, Graduate Education (Master's Program) , 1Term, Special Exercises on Informatics and Data Science B
- 2025, Graduate Education (Master's Program) , 2Term, Special Exercises on Informatics and Data Science B
- 2025, Graduate Education (Master's Program) , Academic Year, Special Study on Informatics and Data Science
- 2025, Graduate Education (Master's Program) , 1Term, Embedded System
- 2025, Graduate Education (Doctoral Program) , Academic Year, Special Study on Informatics and Data Science
- 2025, Liberal Arts Education Program1, 3Term, Starting Programming from Scratch
- 2025, Undergraduate Education, First Semester, Programming III
- 2025, Undergraduate Education, 3Term, Operating Systems
- 2025, Undergraduate Education, 2Term, Informatics and Data Science Exercise II
- 2025, Undergraduate Education, 3Term, Informatics and Data Science Exercise III
- 2025, Undergraduate Education, 4Term, Informatics and Data Science Exercise IV
- 2025, Undergraduate Education, 1Term, Computer Science Seminar I
- 2025, Undergraduate Education, 2Term, Computer Science Seminar II
- 2025, Undergraduate Education, Second Semester, Graduation Thesis
- 2025, Graduate Education (Master's Program) , 1Term, Special Exercises on Informatics and Data Science A
Research Activities
Academic Papers
- FM screening by the local exhaustive search, with hardware acceleration, INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 16(1), 89-104, 200502
- An energy efficient leader election protocol for radio network with a single transceiver, IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, E89A(5), 1355-1361, 200612
- Efficient hardware algorithms for N choose K counters using the bitonic merger, INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 18(3), 517-528, 200706
- A NEW FM SCREENING METHOD TO GENERATE CLUSTER-DOT BINARY IMAGES USING THE LOCAL EXHAUSTIVE SEARCH WITH FPGA ACCELERATION, INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 19(6), 1373-1386, 200812
- LOW-LATENCY CONNECTED COMPONENT LABELING USING AN FPGA, INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 21(3), 405-425, 201006
- AN EFFICIENT PARALLEL SORTING COMPATIBLE WITH THE STANDARD QSORT, INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 22(5), 1057-1071, 201108
- A Graph Rewriting Approach for Converting Asynchronous ROMs into Synchronous Ones, IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E94D(12), 2378-2388, 201112
- A GPU Implementation of Dynamic Programming for the Optimal Polygon Triangulation, IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E96D(12), 2596-2603, 201312
- Offline Permutation Algorithms on the Discrete Memory Machine with Performance Evaluation on the GPU, IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E96D(12), 2617-2625, 201312
- A Classification Processor for a Support Vector Machine with embedded DSP slices and block RAMs in the FPGA, in Proc. of the IEEE 7th International Symposium on Embedded Multicore SoCs (MCSoC), 91-96, 201309
- A Flexible-Length-Arithmetic Processor Using Embedded DSP Slices and Block RAMs in FPGAs, in Proc. of International Symposium on Computing and Networking (CANDAR), 75-84, 201312
- Accelerating computation of Euclidean distance map using the GPU with Efficient memory access, International Journal of Parallel, Emergent and Distributed Systems, 28(5), 383-406, 2013
- An Efficient Implementation of the Hough Transform using DSP slices and block RAMs on the FPGA, in Proc. of the IEEE 7th International Symposium on Embedded Multicore SoCs (MCSoC), 85-90, 201309
- An FPGA implementation for neural networks with the FDFM processor core approach, International Journal of Parallel, Emergent and Distributed Systems, 28(4), 308-320, 2013
- An Optimal Offline Permutation Algorithm on the Hierarchical Memory Machine, with the GPU implementation, in Proc. of 2013 International Conference on Parallel Processing (ICPP), 1-10, 20131001
- ASCII Art Generation using the Local Exhaustive Search on the GPU, in Proc. of International Symposium on Computing and Networking (CANDAR), 194-200, 201312
- Efficient Hough Transform on the FPGA using DSP slices and Block RAMs, in Proc. of Workshop on Advances in Parallel and Distributed Computational Models (APDCM), 771-778, 20130501
- Implementations of the Hough Transform on the Embedded Multicore Processors, International Journal of Networking and Computing (IJNC), 4(1), 174-188, 20140101
- Template Matching using DSP slices on the FPGA, in Proc. of International Symposium on Computing and Networking (CANDAR), 338-344, 201312
- The Approximate String Matching on the Hierarchical Memory Machine, with Performance Evaluation, in Proc. of the IEEE 7th International Symposium on Embedded Multicore SoCs (MCSoC), 79-84, 201309
- The Random Address Shift to Reduce the Memory Access Congestion on the Discrete Memory Machine, in Proc. of International Symposium on Computing and Networking (CANDAR), 95-103, 201312
- TinyCSE: Tiny Computer System for Education, in Proc. of International Symposium on Computing and Networking (CANDAR), 639-641, 201312
- Offline Permutation on the CUDA-enabled GPU, IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E97D(12), 3052-3062, 201412
- An Optimal Implementation of the Approximate String Matching on the Hierarchical Memory Machine, with Performance Evaluation on the GPU, IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E97D(12), 3063-3071, 201412
- Efficient Exhaustive Verification of the Collatz Conjecture using DSP blocks of Xilinx FPGAs, International Journal of Networking and Computing, 1(1), 49-62, 201101
- An RSA Encryption Hardware Algorithm using a Single DSP Block and a Single Block RAM on the FPGA, International Journal of Networking and Computing, 1(2), 277-289, 201107
- Accelerating the CKY parsing using FPGAs, IEICE Transactions on Information and Systems, E86-D(5), 803-810, 201312
- Instance-Specific Solutions to Accelerate the CKY Parsing for Large Context-free Grammars, International Journal on Foundations of Computer Science, 15(2), 403-416, 200404
- Implementations of a Parallel Algorithm for Computing Euclidean Distance Map in Multicore Processors and GPUs, International Journal of Networking and Computing, 1(2), 260-276, 201107
- The Parallel FDFM Processor Core Approach for CRT-based RSA Decryption, International Journal of Networking and Computing, 2(1), 79-96, 201201
- An Algorithm to Obtain Circuits with Synchronous RAMs, Journal of Communication and Computer, 9(5), 547-559, 201212
- A Rewriting Approach to Replace Asynchronous ROMs with Synchronous Ones for the Circuits with Cycles, International Journal of Networking and Computing, 2(1), 269-290, 201207
- Accelerating ant colony optimisation for the travelling salesman problem on the GPU, International Journal of Parallel, Emergent and Distributed Systems, 29(4), 401-420, 20140801
- Bulk Execution of Oblivious Algorithms on the Unified Memory Machine, with GPU Implementation, Proc. of International Parallel and Distributed Processing Symposium Workshops, 586-595, 20140519
- An Efficient Implementation of the Gradient-based Hough Transform using DSP slices and block RAMs on the FPGA, Proc. of International Parallel and Distributed Processing Symposium Workshops, 762-770, 20140519
- C2CU : A CUDA C Program Generator for Bulk Execution of a Sequential Algorithm, Proc. of International Conference on Algorithms and Architectures for Parallel Processing, 178-191, 201408
- GPU-accelerated Verification of the Collatz Conjecture, Proc. of International Conference on Algorithms and Architectures for Parallel Processing, 483-496, 201408
- A GPU Implementation of Clipping-Free Halftoning using the Direct Binary Search, Proc. of International Conference on Algorithms and Architectures for Parallel Processing, 57-70, 201408
- Random Address Permute Shift Technique for the Shared Memory on GPUs, Proc. of International Conference on Parallel Processing Workshops, 429-483, 201409
- Parallel Algorithms for the Summed Area Table on the Asynchronous Hierarchical Memory Machine, with GPU implementations, Proc. of International Conference on Parallel Processing, 251-250, 201409
- Thorough Evaluation of GPU Shared Memory Load and Store Instructions, in Proc. of International Symposium on Computing and Networking, 614-616, 201412
- An Efficient Implementation of the One-Dimensional Hough Transform Algorithm for Circle Detection on the FPGA, in Proc. of International Symposium on Computing and Networking, 447-452, 201412
- Optimality of Fundamental Parallel Algorithms on the Hierarchical Memory Machine, with GPU implementation, Proc. of International Conference on Parallel, Distributed and Network-Based Processing, 626-634, 201503
- A character art generator using the local exhaustive search, with GPU acceleration, INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 31(1), 47-63, 201601
- Bulk execution of Euclidean algorithms on the CUDA-enabled GPU, International Journal of Networking and Computing, 6(1), 42-63, 201601
- Bulk GCD Computation Using a GPU to Break Weak RSA Keys, Proc. of International Parallel and Distributed Processing Symposium Workshops, 385-394, 201505
- GPU-accelerated Digital Halftoning by the Local Exhaustive Search, Proc. of the 14th International Symposium on Parallel and Distributed Computing, 82-87, 201506
- Optimal Parallel Hardware K-Sorter and TopK-Sorter, with FPGA implementations, Proc. of the 14th International Symposium on Parallel and Distributed Computing, 138-147, 201506
- Parallel FDFM Approach for Computing GCDs Using the FPGA, Proc. of 11th International Conference of Parallel Processing and Applied Mathematics, 238-247, 201509
- A Parallel Algorithm for LZW decompression, with GPU implementation, Proc. of 11th International Conference of Parallel Processing and Applied Mathematics, 228-237, 201509
- Fast LZW compression using a GPU, Proc. of International Symposium on Computing and Networking, 303-308, 201512
- A Warp-synchronous Implementation for Multiple-length Multiplication on the GPU, Proc. of International Symposium on Computing and Networking, 96-102, 201512
- A Fast Approximate String Matching Algorithm on GPU, Proc. International Symposium on Computing and Networking, 188-192, 201512
- Parallelization Techniques for Error Diffusion with GPU Implementations, Proc. of International Symposium on Computing and Networking, 30-39, 201512
- A flexible-length-arithmetic processor based on FDFM approach in FPGAs, Proc. of International Symposium on Computing and Networking, 364-370, 201512
- Efficient GPU implementations for the Conway's Game of Life, Proc. of International Symposium on Computing and Networking, 11-20, 201512
- Accelerating digital halftoning using the local exhaustive search on the GPU, Concurrency and Computation: Practice and Experience, Web(Web), Web-Web, 20160212
- An FPGA Implementation for a Flexible-Length-Arithmetic Processor Employing the FDFM Processor Core Approach, IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E99D(12), 2901-2910, 201612
- Fully Parallelized LZW Decompression for CUDA-Enabled GPUs, IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E99D(12), 2986-2994, 201612
- A Memory-Access-Efficient Implementation for Computing the Approximate String Matching Algorithm on GPUs, IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E99D(12), 2995-3003, 201612
- GPU-Accelerated Bulk Execution of Multiple-Length Multiplication with Warp-Synchronous Programming Technique, IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E99D(12), 3004-3012, 201612
- Fast Simulation of Conway's Game of Life Using Bitwise Parallel Bulk Computation on a GPU, INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 27(8), 981-1003, 201612
- GPU-accelerated Exhaustive Verification of the Collatz Conjecture, International Journal of Networking and Computing, 7(1), 69-85, 201701
- Efficient Implementation of FDFM Approach for Euclidean Algorithms on the FPGA, International Journal of Networking and Computing, 6(2), 420-435, 201607
- Light Loss-Less Data Compression, with GPU implementation, Proc. of the 16th International Conference on Algorithms and Architectures for Parallel Processing, 281-294, 201612
- An Efficient Implementation of LZW Compression in the FPGA, Proc. of the 16th International Conference on Algorithms and Architectures for Parallel Processing, 512-520, 201612
- Accelerating Ant Colony Optimization for the Vertex Coloring Problem on the GPU, Proc. of International Symposium on Computing and Networking, 469-475, 201612
- A Memory-Access-Efficient Implementation of the Approximate String Matching Algorithm on GPU, Proc. of International Symposium on Computing and Networking, 483-489, 201612
- A hardware sorter for almost sorted sequences, with FPGA implementations, Proc. of International Symposium on Computing and Networking, 565-571, 201612
- An Evaluation of the Parallella Architecture for the Convex Hull Computation, Proc. of International Symposium on Computing and Networking, 704-706, 201612
- GPU-Accelerated Bulk Computation of the Eigenvalue Problem for Many Small Real Non-symmetric Matrices, Proc. of International Symposium on Computing and Networking, 490-496, 201612
- Bitwise Parallel Bulk Computation on the GPU, with Application to the CKY Parsing for Context-Free Grammars, Proc. of International Parallel and Distributed Processing Symposium Workshops, 589-598, 201605
- An Efficient Implementation of LZW Decompression in the FPGA, Proc. of International Parallel and Distributed Processing Symposium Workshops, 599-607, 201605
- C2CU: a CUDA C program generator for bulk execution of a sequential algorithm, CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 29(17), e4022, 20170910
- Adaptive loss-less data compression method optimized for GPU decompression, CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 29(24), e4283, 20171225
- An Efficient GPU Implementation of CKY Parsing Using the Bitwise Parallel Bulk Computation Technique, IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E100D(12), 2857-2865, 201712
- Almost optimal column-wise prefix-sum computation on the GPU, JOURNAL OF SUPERCOMPUTING, 74(4), 1510-1521, 201804
- An Efficient GPU Implementation of Bulk Computation of the Eigenvalue Problem for Many Small Real Non-symmetric Matrices, International Journal of Networking and Computing, 7(2), 227-247, 201707
- Single Kernel Soft Synchronization Technique for Task Arrays on CUDA-enabled GPUs, Proc. of International Symposium on Computing and Networking, 11-20, 201711
- A Square Pointillism Image Generation, and its GPU Acceleration, Proc. of International Symposium on Computing and Networking, 38-47, 201711
- A Hybrid Architecture for the Approximate String Matching on an FPGA, Proc. of International Symposium on Computing and Networking, 48-57, 201711
- A GPU Implementation of Bulk Execution of the Dynamic Programming for the Optimal Polygon Triangulation, Proc. of 12th International Conference of Parallel Processing and Applied Mathematics, 314-323, 201709
- Almost Optimal Column-wise Prefix-sum Computation on the GPU, Proc. of 12th International Conference of Parallel Processing and Applied Mathematics, 224-233, 201709
- Simple and Fast Parallel Algorithms for the Voronoi Map and the Euclidean Distance Map, with GPU implementations, Proc. of 46th International Conference on Parallel Processing, 362-371, 201708
- Photomosaic Generation by Rearranging Subimages, with GPU Acceleration, Proc. of International Parallel and Distributed Processing Symposium Workshops, 942-951, 201705
- Accelerating the Smith-Waterman Algorithm Using Bitwise Parallel Bulk Computation Technique on GPU, Proc. of International Parallel and Distributed Processing Symposium Workshops, 932-941, 201705
- Efficient Byte Stream Pattern Test using Bloom Filter with Rolling Hash Functions on the FPGA, Proc. of International Symposium on Computing and Networking, 66-75, 201811
- A Prefix-Sum-Based Rabin-Karp Implementation for Multiple Pattern Matching on GPGPU, Proc. of International Symposium on Computing and Networking, 139-145, 201811
- Tile Art Image Generation Using Conditional Generative Adversarial Networks, Proc. of International Symposium on Computing and Networking Workshops, 209-215, 201811
- An Optimal Parallel Algorithm for Computing the Summed Area Table on the GPU, Proc. of International Parallel and Distributed Processing Symposium Workshops, 763-772, 201811
- Bulk execution of the dynamic programming for the optimal polygon triangulation problem on the GPU, CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 31(19), e4947, 20191010
- Accelerating the Smith-Waterman Algorithm Using the Bitwise Parallel Bulk Computation Technique on the GPU, IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E102D(12), 2400-2408, 201912
- Efficient convolution pooling on the GPU, JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 138, 222-229, 20200401
- Folded Bloom Filter for High Bandwidth Memory, with GPU implementations, Proc. of International Symposium on Computing and Networking, 18-27, 201911
- Throughput-Optimal Hardware Implementation of LZW Decompression on the FPGA, Proc. of International Symposium on Computing and Networking Workshops, 78-83, 201911
- Efficient GPU Implementations to Compute the Diameter of a Graph, Proc. of International Symposium on Computing and Networking, 102-111, 201911
- Structured Sparse Fully-Connected Layers in the CNNs and its GPU Acceleration, Proc. of International Symposium on Computing and Networking Workshops, 148-154, 201911
- A Watercolor Painting Image Generation using Stroke-based Rendering, Proc. of International Symposium on Computing and Networking Workshops, 465-469, 201911
- Efficient cuDNN-compatible Convolution-Pooling on the GPU, Proc. of 13th International Conference of Parallel Processing and Applied Mathematics, 46-58, 201909
- Efficient Triangular Matrix Vector Multiplication on the GPU, Proc. of 13th International Conference of Parallel Processing and Applied Mathematics, 493-504, 201909
- Stained Glass Image Generation using Voronoi Diagram and its GPU Acceleration, Proc. of 13th International Conference of Parallel Processing and Applied Mathematics, 396-407, 201909
- FIFO-Based Hardware Sorters for High Bandwidth Memory, Proc. of International Parallel and Distributed Processing Symposium Workshops, 663-672, 201905
- A Rabin-Karp Implementation for Handling Multiple Pattern-Matching on the GPU, IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E103D(12), 2412-2420, 202012
- Efficient implementations of Bloom filter using block RAMs and DSP slices on the FPGA, CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 33(12), e5623, 20210625
- Tile art image generation using parallel greedy algorithm on the GPU and its approximation with machine learning, CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 33(12), e5623, 20210625
- Efficient GPU Implementation for Solving the Maximum Independent Set Problem, Proc. of International Symposium on Computing and Networking, 29-38, 202011
- Fully-Pipelined Architecture for Simulated Annealing-based QUBO Solver on the FPGA, Proc. of International Symposium on Computing and Networking, 39-48, 202011
- Art Font Image Generation with Conditional Generative Adversarial Networks, Proc. of International Symposium on Computing and Networking Workshops, 151-156, 202011
- Huffman Coding with Gap Arrays for GPU Acceleration, Proc. of International Conference on Parallel Processing, article no. 1, 202008
- Adaptive Bulk Search: Solving Quadratic Unconstrained Binary Optimization Problems on Multiple GPUs, Proc. of International Conference on Parallel Processing, article no. 62, 202008
- An Efficient Multicore CPU Implementation for Convolution-Pooling Computation in CNNs, Proc. of International Parallel and Distributed Processing Symposium Workshops, 548-556, 202005
- A Work-Time Optimal Parallel Exhaustive Search Algorithm for the QUBO and the Ising model, with GPU implementation, Proc. of International Parallel and Distributed Processing Symposium Workshops, 557-566, 202005
Invited Lecture, Oral Presentation, Poster Presentation
- Fully Parallelized Lossless LZW decompression for CUDA-enabled GPUs, Koji Nakano, Koji Nakano, Shunji Funasaka, Yasuaki Ito, GPU Technology Conference (GTC 2016), 2016/05, Without Invitation, English, San Jose, USA
- Bitwise Parallel Bulk Computation for CKY Parsing, with the GPU implementation, Toru Fujita, Toru Fujita, Koji Nakano, Yasuaki Ito, GPU Technology Conference (GTC Japan 2016), 2016/10, Without Invitation, English, Tokyo
- Efficient Loss-Less Data Compression, with GPU implementation, Shunji Funasaka, Shunji Funasaka, Koji Nakano and Yasuaki, GPU Technology Conference (GTC Japan 2016), 2016/10, Without Invitation, English, Tokyo
- Bulk Computation of Eigenvalues of Many Small Real Non-symmetric Matrices on the GPU, Takumi Honda, Hiroki Tokura, Takumi Honda, Yasuaki Ito, Koji Nakano, Mitsuya Nishino, Yushiro Hirota, Masami Saeki, GPU Technology Conference (GTC Japan 2016), 2016/10, Without Invitation, English, Tokyo
- A Very Fast Data Compression Method Optimized for GPU Decompression, Koji Nakano, Koji Nakano, Shunji Funasaka, Yasuaki Ito, The GPU Technology Conference (GTC 2017), 2017/05, Without Invitation, English, San Jose, USA
- A Photomosaic Method by Rearranging Divided Images with GPU Acceleration, Yasuaki Ito, Yi Yang, Yasuaki Ito, Koji Nakano, Jacir L. Bordim, The GPU Technology Conference (GTC 2017), 2017/05, Without Invitation, English
- GPU Applications with Single Kernel Synchronization Technique, Shunji Funasaka, Shunji Funasaka, Koji Nakano, Yasuaki Ito, GPU Technology Conference (GTC Japan 2017), 2017/12, Without Invitation, English, Tokyo
- GPU Implementation of Image Generation for Square Pointillism, Hiroki Tokura, Hiroki Tokura, Yasuaki Ito, Koji Nakano, GPU Technology Conference (GTC Japan 2017), 2017/12, Without Invitation, English, Tokyo
- Parallel High-Quality Art Generation From Any Image Using A GPU, Hiroki Tokura, Yuki Kuroda, Yasuaki Ito, and Koji Nakano, Hiroki Tokura, Yuki Kuroda, Yasuaki Ito, and Koji Nakano, The GPU Technology Conference, 2018/05, Without Invitation, English, San Jose, USA
Awards
- 2020/08/20, Best Paper Award, ICPP Organizing Committee, Huffman Coding with Gap Arrays for GPU Acceleration
- 2020/11/27, Best Paper Award, Organizing Committee, Fully-Pipelined Architecture for Simulated Annealing-based QUBO Solver on the FPGA
- 2020/11/27, Best Paper Award, GCA Organizing Committee, Art Font Image Generation with Conditional Generative Adversarial Networks
- 2019/11/28, The 4th International Workshop on GPU Computing and AI Best Paper Award, GCA Organizing Committee, Structured Sparse Fully-Connected Layers in the CNNs and its GPU Acceleration
- 2018/11/29, The 6th International Symposium on Computing and Networking Best Paper Award, CANDAR Organizing Committee
- 2017/11/20, The 5th International Symposium on Computing and Networking Outstanding Paper Award, CANDAR 2017 Organizing Committee
- 2016/11/23, Best Paper Award
The 8th International Workshop on Parallel and Distributed Algorithms and Applications (PDAA), PDAA Organizing Committee
- 2016/12/15, Best Student Paper Award
16th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP), ICA3PP General Chair
- 2015/12/10, Outstanding Paper Award, CANDAR Organizing Committee
- 2015/12/10, Best Paper Award, CANDAR Organizing Committee
External Funds
Acceptance Results of Competitive Funds
- KAKENHI(Grant-in-Aid for Scientific Research (C)), 2020, 2022
- 2019/04/01, 2020/03/31
- KAKENHI, 2016, 2018
- KAKENHI, 2013, 2015
- KAKENHI, 2012, 2015
- KAKENHI, Research on abstract modelsof FPGAs and evaluation of hardware algorithms, 2009, 2012
- KAKENHI, A study on establishment of a theory for accelerating computation based on partial-computation using FPGAs, 2008, 2011
- KAKENHI, 2005, 2007
- KAKENHI, 2005, 2008
Social Activities
History as Committee Members
- Finance & Publication Chair, 2022/01, 2022/12, International Symposium on Computing and Networking (CANDAR)
- Guest Editor, 2021/12, 2021/11, Concurrency and Computation Practice and Experience (CCPE) special issue on CANDAR 2021
- Guest Editor, 2021/12, 2022/11, International Journal of Networking and Computing (IJNC) special issue on CANDAR 2021
- Guest Associate Editor, 2021/10, 2022/11, Special Section on Forefront Computing of IEICE Transactions on Information and Systems
- Guest Editor, 2021/02, 2021/07, International Journal of Networking and Computing (IJNC) special issue on CANDAR 2020
- Guest Editor, 2021/01, 2021/01, Concurrency and Computation Practice and Experience (CCPE) special issue on CANDAR 2020
- Guest Associate Editor, 2019/12, 2021/12, Special Section on Parallel, Distributed, and Reconfigurable Computing, and Networking of IEICE Transactions on Information and Systems
- Guest Editor, 2019/12, 2020/12, Concurrence and Computation Practice and Experience (CCPE) special issue on CANDAR 2019
- Workshop Co-chair, 2019/04, 2022/11, International Workshop on GPU Computing and AI
- Guest Editor, 2019/01, 2019/10, Concurrence and Computation Practice and Experience (CCPE) special issue on CANDAR 2018
- Program Committee, 2019/01, 2019/05, The 9th International Workshop on Networking, Computing, Systems, and Software
- Program Committee, 2018/04, 2018/12, International Conference on Algorithms and Architectures for Parallel Processing
- Guest Editor, 2018/02, 2018/07, International Journal of Networking and Computing
- Program Committee, 2018/01, 2018/09, International Conference on Algorithms and Architectures for Parallel Processing
- Program Committee, 2017/11, 2018/05, Workshop on Advances in Parallel and Distributed Computational Models
- Guest Associate Editor, 2017/06, 2020/12, Special Section on Parallel and Distributed Computing and Networking of IEICE Transactions on Information and Systems
- Editor, 2017/05, 9999/99/99, International Journal of Networking and Computing
- Program Committee, 2017/03, 2018/12, International Workshop on Parallel and Distributed Algorithms and Applications
- Guest Editor, 2017/02, 2017/07, International Journal of Networking and Computing
- Workshop Co-chair, 2017/01, 2019/12, International Workshop on GPU Computing and Applications
- Program Committee, 2017/01, 2017/08, International Conference on Algorithms and Architectures for Parallel Processing
- Workshop Co-chair, 2016/04, 2017/12, International Workshop on GPU Computing and Applications
- Special Section on Parallel and Distributed Computing and Networking of IEICE Transactions on Information and Systems, Guest Associate Editor, 2015/12, 2017/12, IEICE
- Guest Editor, 2014/02, 2014/04, International Journal of Networking and Computing
- Registration and Finance Chair, 2013/03, 2021/12, International Symposium on Computing and Networking
- Program Committee, 2012/04, 2022/12, International Workshop on Parallel and Distributed Algorithms and Applications
- Secretariat, 2010/12, 9999/99, International Journal of Networking and Computing
- Program Committee, 2008/12, 2022/05, Workshop on Advances in Parallel and Distributed Computational Models
Organizing Academic Conferences, etc.
- The Sixth International Workshop on GPU Computing and Applications, Workshop co-chair, 2021/01, 2021/12
- The Fourth International Workshop on GPU Computing and Applications, Workshop Co-chair, 2019/01, 2019/12
- The Second International Workshop on GPU Computing and Applications, Workshop Co-chair, 2017/01, 2017/12
- The First International Workshop on GPU Computing and Applications, Workshop Co-chair, 2016/01, 2016/12
- The third International Workshop on GPU Computing and AI, Workshop co-chair, 2018/01, 2018/12
- The Fifth International Workshop on GPU Computing and Applications, Workshop Co-chair, 2020/01, 2020/12