Publications
Publications by categories in reversed chronological order. Generated by jekyll-scholar.
2024
- Scalable, Programmable and Dense: The HammerBlade Open-Source RISC-V ManycoreIn 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA), 2024
2020
- A 7.3 M Output Non-Zeros/J, 11.7 M Output Non-Zeros/GB Reconfigurable Sparse Matrix–Matrix Multiplication AcceleratorIEEE Journal of Solid-State Circuits, 2020
2019
- Evaluating Celerity: A 16-nm 695 Giga-RISC-V Instructions/s Manycore Processor With Synthesizable PLLIEEE Solid-State Circuits Letters, 2019
- A 1.4 GHz 695 Giga Risc-V Inst/s 496-Core Manycore Processor With Mesh On-Chip Network and an All-Digital Synthesized PLL in 16nm CMOSIn 2019 Symposium on VLSI Circuits, 2019
- A 7.3 M Output Non-Zeros/J Sparse Matrix-Matrix Multiplication Accelerator using Memory Reconfiguration in 40 nmIn 2019 Symposium on VLSI Technology, 2019
2018
- The Celerity open-source 511-core RISC-V tiered accelerator fabric: Fast architectures and design methodologies for fast chipsIEEE Micro, 2018
- Fast and efficient deep sparse multi-strength spiking neural networks with dynamic pruningIn 2018 International Joint Conference on Neural Networks (IJCNN), 2018
- FBNA: A Fully Binarized Neural Network AcceleratorIn 2018 28th International Conference on Field Programmable Logic and Applications (FPL), 2018
- The BaseJump Manycore Accelerator NetworkarXiv preprint arXiv:1808.00650, 2018
- Low Latency Spiking ConvNets with Restricted Output Training and False Spike InhibitionIn 2018 International Joint Conference on Neural Networks (IJCNN), 2018
- Extreme Datacenter Specialization for Planet-Scale Computing: ASIC CloudsACM SIGOPS Operating Systems Review, 2018
- Parallel Polar Encoding in 5G CommunicationIn 2018 IEEE Symposium on Computers and Communications (ISCC), 2018
- Parallel filtering method and corresponding apparatusMay 2018US Patent 9,966,932
- Progress in a novel architecture for high performance processingJapanese Journal of Applied Physics, May 2018
2017
- Celerity: An open source RISC-V tiered accelerator fabricIn Hot Chips: A Symposium on High Performance Chips, May 2017
- Experiences Using the RISC-V Ecosystem to Design an Accelerator-Centric SoC in TSMC 16nmMay 2017
- A self-indexed register file for efficient arithmetical computing hardwareIn 2017 9th Computer Science and Electronic Engineering (CEEC), May 2017
- A reconfigurable ASIC-like image polyphase interpolation implementation methodIn 2017 7th IEEE International Conference on Electronics Information and Emergency Communication (ICEIEC), May 2017
2016
- MaPU: A novel mathematical computing architectureIn 2016 IEEE International Symposium on High Performance Computer Architecture (HPCA), May 2016
- Methods and devices for multi-granularity parallel FFT butterfly computationFeb 2016US Patent 9,262,378
- Data access method and device for parallel FFT computationApr 2016US Patent 9,317,481
- Parallel bit reversal devices and methodsFeb 2016US Patent 9,268,744
2015
- Multi-granularity parallel storage systemOct 2015US Patent 9,171,593
- Multi-granularity parallel storage system and storageSep 2015US Patent 9,146,696