Publications
- Hung-Ming Lai, Jenq-Kuen Lee, and Yuan-Shin Hwang,
Enhancing LLVM Optimizations for Linear Recurrence Programs on RVV,
The 2023 International Workshop on Embedded Multicore Systems (ICPP-EMS 2023), Salt Lake City, Utah, USA, 2023
- Shao-Chung Wang, Lin-Ya Yu, Li-An Her, Yuan-Shin Hwang, and Jenq-Kuen Lee,
Pointer-Based Divergence Analysis for OpenCL 2.0 Programs,
ACM Transactions on Parallel Computing, Volume 8, Issue 4, Article 20, pp 1-23, December 2021
- Yuan-Ming Chang, Wei-Cheng Liao, Shao-Chung Wang, Chun-Chieh Yang, and
Yuan-Shin Hwang,
A Framework for Scheduling Dependent Programs on GPU Architectures,
Journal of Systems Architecture, Volume 106, June 2020
- Heng Lin, Piyou Chen, Yuan-Shin Hwang, and Jenq-Kuen Lee,
Devise Rust Compiler Optimizations on RISC-V Architectures with SIMD Instructions,
The 2019 International Workshop on Embedded Multicore Systems (ICPP-EMS 2019), Kyoto, Japan, 2019
- Yuan-Shin Hwang, Hsih-Hsin Lin, Shen-Hung Pai, and Chia-Heng Tu,
GPUBlocks: GUI Programming Tool for CUDA and OpenCL,
Journal of Signal Processing Systems, Volume 91, Pages 235-245, 2019
- Chun-Chieh Yang, Shao-Chung Wang, Min-Yi Hsu, Yuan-Ming Chang, Yuan-Shin Hwang, and Jenq-Kuen Lee,
Support OpenCL 2.0 Compiler on LLVM for PTX Simulators,
Journal of Signal Processing Systems, Volume 91, Pages 261-271, 2019
- Wei-Cheng Liao, Yuan-Ming Chang, Shao-Chung Wang, Chun-Chieh Yang and Yuan-Shin Hwang,
Scheduling Methods to Optimize Dependent Programs for GPU Architecture,
The 2018 International Workshop on Embedded Multicore Systems (ICPP-EMS 2018), Eugene, Oregon, USA, 2018
- Shih-Huan Chien, Yuan-Ming Chang, Chun-Chieh Yang, Yuan-Shin Hwang and Jenq-Kuen Lee,
Graph Support and Scheduling for OpenCL on Heterogeneous Multi-core Systems,
The 2018 International Workshop on Embedded Multicore Systems (ICPP-EMS 2018), Eugene, Oregon, USA, 2018
- Shao-Chung Wang, Li-Chen Kan, Chao-Lin Lee, Yuan-Shin Hwang, and Jenq-Kuen Lee,
Architecture and Compiler Support for GPUs Using Energy-Efficient Affine Register Files,
ACM Transactions on Design Automation of Electronic Systems, Volume 23, No. 2, Article 18, November 2017
- Yuan-Ming Chang, Shao-Chung Wang, Chun-Chieh Yang, Yuan-Shin Hwang, and Jenq-Kuen Lee,
Enabling PoCL-based runtime frameworks on the HSA for OpenCL 2.0 support,
Journal of Systems Architecture, Volume 81, Pages 71-82, November 2017
- Chun-Chieh Yang, Shao-Chung Wang, Min-Yi Hsu, Yuan-Ming Chang, Yuan-Shin Hwang, and Jenq-Kuen Lee,
OpenCL 2.0 Compiler Adaptation on LLVM for PTX Simulators,
The 2017 International Workshop on Embedded Multicore Systems (ICPP-EMS 2017), Bristol, UK, 2017
- Yuan-Shin Hwang and Wei-Che Hsu,
Floating Accumulator Architecture,
Microprocessors and Microsystems, Volume 51, Pages 8-17, June 2017
- Li Wang, Ren-Wei Tsai, Shao-Chung Wang, Kun-Chih Chen, Po-Han Wang, Hsiang-Yun Cheng, Yi-Chung Lee, Sheng-Jie Shu, Chun-Chieh Yang, Min-Yih Hsu, Li-Chen Kan, Chao-Lin Lee, Tzu-Chieh Yu, Rih-Ding Peng, Chia-Lin Yang, Yuan-Shin Hwang, Jenq-Kuen Lee, Shiao-Li Tsao, and Ming Ouhyoung,
Analyzing OpenCL 2.0 Workloads Using a Heterogeneous CPU-GPU Simulator,
The 2017 IEEE International Symposium on Performance Analysis of Systems and Software (Poster, ISPASS 2017), Santa Rosa, California, April 2017
- Shao-Chung Wang, Li-Chen Kan, Yuan-Shin Hwang, and Jenq-Kuen Lee,
Energy Efficient Affine Register File for GPU Microarchitecture,
The 2016 International Workshop on Embedded Multicore Systems (ICPP-EMS 2016), Philadelphia, August 2016
- Hsih-Hsin Lin, Chia-Heng Tu, and Yuan-Shin Hwang,
CUDABlock: A GUI Programming Tool for CUDA,
Proceedings of The 2015 International Workshop on Embedded Multicore Systems (ICPP-EMS 2015), Beijing, September 2015
- Shao-Chung Wang, Ming-Yu Hung, Jenq-Kuen Lee, Yuan-Shin Hwang, and Roy Dz-Ching Ju,
Pointer-Based Divergence Analysis in the SSA Form,
Proceedings of the 17th Workshop on Compilers for Parallel Computing (CPC 2013), Lyon, July 3-5, 2013
- Jen Huang, Huang-Jia Cheng and Yuan-Shin Hwang,
Power Devil: Tool for Power Gating Strategy Selection,
Proceedings of The 10th Workshop on Optimizations for DSP and Embedded Systems (ODES-10), Shenzen, February 2013
- Ming-Yu Hung, Peng-Sheng Chen, Yuan-Shin Hwang, Roy Dz-Ching Ju, and Jenq Kuen Lee,
Support of Probabilistic Pointer Analysis in the SSA Form,
IEEE Transactions on Parallel and Distributed Systems, Volume 23. No. 12, Pages 2366-2379, December 2012
- Hsu-Hung Chiang, Huang-Jia Cheng, and Yuan-Shin Hwang,
Doubling the Number of Registers on ARM Processors,
Proceedings of the 16th Workshop on Interaction between Compilers and Computer Architectures (INTERACT-16), February 2012
- Yuan-Shin Hwang, Tzong-Yuan Lin, and Rong-Guey Chang,
DisIRer: Converting a Retargetable Compiler into a Multi-Platform Binary Translator,
ACM Transactions on Architecture and Code Optimization (TACO), Volume 7, No. 4, Article 18, December 2010
- Huang-Jia Cheng, Yuan-Shin Hwang, Rong-Guey Chang, and Cheng-Wei Chen,
Trading Conditional Execution for More Registers on ARM Processors,
Proceedings of the 2010 IEEE/IFIP International Conference on Embedded and Ubiquitous Computing (EUC-10), Pages 53-59, Hong Kong, December 2010
- Jia-Jhe Li and Yuan-Shin Hwang,
Set-Associative Load/Store Caches,
Proceedings of the 2010 International Conference on High Performance Computing Systems (HPCS-10), Pages 1-8,
Orlando, July 2010
- Yuan-Shin Hwang and Jia-Jhe Li,
On Reducing Load/Store Latencies of Cache Accesses,
Journal of Systems Architecture, Volume 56, Issue 1, Pages 1-15, January 2010
- Jia-Jhe Li and Yuan-Shin Hwang,
Indirect-Mapped Caches:
Approximating Set-Associativity with Direct-Mapped Caches,
Proceedings of The 2009 International Conference on Computer Design (CDES'09),
Pages 164-170, Las Vegas, July 2009
- Yen-Hsiang Fan, Yuan-Shin Hwang, Yi-Ping You, and Jenq-Kuen Lee,
Compiler-based vs. Hardware-based Power Gating Techniques for Functional Units,
Proceedings of The 6th Workshop on Optimizations for DSP and Embedded Systems (ODES-6), Boston, April 2008
- Yuan-Shin Hwang and Jia-Jhe Li,
Snug Set-Associative Caches: Reducing Leakage Power of Instruction and Data Caches with No Performance Penalties,
ACM Transactions on Architecture and Code Optimization (TACO), Volume 4, No. 1, Article 6, March 2007
- Chia-Heng Tu, Peng-Sheng Chen, and Yuan-Shin Hwang,
Hierarchical Shape Analysis,
Proceedings of the 13th Workshop on Compilers for Parallel Computing (CPC'07), Lisbon, Portugal, July 2007
- Ying Chen Lin, Sy-Yuan Li, and Yuan-Shin Hwang,
Dynamic Load-Balancing of Jini and .NET Services,
Proceedings of the ICPP06 Workshop on Compile/Runtime Techniques for Parallel Computing (CRTPC 2006), Pages 257-264, Columbus, Ohio, August 2006
- Jia-Jhe Li and Yuan-Shin Hwang,
On Reducing Load/Store Latencies of Cache Accesses,
Proceedings of the 4th Workshop on Memory Performance Issues (WMPI-2006), Pages 131-138, Austin, Texas, February 2006
- Jia-Jhe Li and Yuan-Shin Hwang,
Snug Set-Associative Caches: Reducing Leakage Power while Improving Performance,
Proceedings of the 2005 International Symposium on Low Power Electronics and Design (ISLPED 2005), Pages 345-350, San Diego, August 2005
- Hung-Hsiang Lin, Chia-Heng Tu, and Yuan-Shin Hwang,
Dynamic Load-Balancing of Jini Services with Smart Proxies,
Proceedings of The 2005 International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA'05), Volume II, Pages 721-726, Las Vegas, June 2005
- Chung-Chin Luo, Yuan-Shin Hwang, and Gene Eu Jan,
Minimal Steiner Trees in X Architecture with Obstacles,
Proceedings of The 2005 International Conference on Computer Design (CDES'05), Pages 198-203, Las Vegas, June 2005
- Peng-Sheng Chen, Yuan-Shin Hwang, Roy Dz-Ching Ju, and Jenq Kuen Lee,
Interprocedural Probabilistic Pointer Analysis,
IEEE Transactions on Parallel and Distributed Systems, Volume 15, No. 10,
pp. 893-907, October 2004
- Gene Eu Jan and Yuan-Shin Hwang,
Hierarchical Interconnection Networks Based on (3,3)-Graphs for Massively Parallel Processors,
IEICE Transactions on Information and Systems, special issue on
Hardware/Software Support for High Performance Scientific and Engineering Computing, VolumeE87-D, No. 7, pp. 1649-1656, July 2004
- Gene Eu Jan, Yuan-Shin Hwang, Ming-Bo Lin, and Deron Liang,
Novel Hierarchical Interconnection Networks for High-Performance Multicomputer Computers,
Journal of Information Science and Engineering, Volume 20, No. 6, Pages 1213-1229, 2004
- Peng-Sheng Chen, Ming-Yu Hung, Yuan-Shin Hwang, Roy Dz-Ching Ju, and
Jenq Kuen Lee,
Compiler Support for Speculative Multithreading Architecture
with Probabilistic Points-to Analysis,
ACM SIGPLAN Notices,
Proceedings of the 9th ACM SIGPLAN Symposium on Principles and Practice
of Parallel Programming, Volume 38, No. 10, pp. 25-36, October 2003
- Yuan-Shin Hwang and Joel Saltz,
Identifying Parallelism in Programs with Cyclic Graphs,
Journal of Parallel and Distributed Computing, Volume 63, No. 3, pp. 337-355, 2003
- Yuan-Shin Hwang, Peng-Sheng Chen, Jenq Kuen Lee, and Roy Dz-Ching Ju,
Probabilistic Points-to Analysis,
Lecture Notes in Computer Science, Volume 2624,
Languages and Compilers for Parallel Computing (LCPC 2001 Issue),
pp. 290-305, 2003
- Yuan-Shin Hwang and Joel Saltz,
Interprocedural Definition-Use Chains of Dynamic Pointer-Linked
Data Structures,
Scientific Programming, Volume 11, No.1, pp. 3-37, 2003
- Gene Eu Jan and Yuan-Shin Hwang,
An Efficient Algorithm for Perfect Load Balancing on Hypercube Multiprocessors,
Journal of Supercomputing, Volume 25, No. 1, pp. 5-15, 2003
- Yuan-Shin Hwang,
Parallelizing Graph Construction Operations in Program with Cyclic Graphs,
Parallel Computing, Volume 28, No. 9, Pages 1307-1328, September 2002
- Yung-Chia Lin, Yuan-Shin Hwang, and Jenq Kuen Lee,
Compiler Optimizations with DSP-Specific Semantic Descriptions,
To appear in Lecture Notes in Computer Science,
Languages and Compilers for Parallel Computing (LCPC 2002 Issue), 2002
- Raja Das, Yuan-Shin Hwang, Joel Saltz, and Alan Sussman,
Runtime and Compiler Support for Irregular Computations,
Lecture Notes in Computer Science, Volume 1808,
Runtime and Compiler Support for Irregular Computations,
Chapter 21, Pages 751-778,
Santosh Pande and Dharma Agrawal (Editors), 2001
- Yuan-Shin Hwang and Gene Eu Jan,
A Simple Algorithm for Optimal Load Balancing on Hypercube Multiprocessors,
Proceedings of the 2001 International Conference
on Parallel and Distributed Processing Techniques and Applications (PDPTA'2001),
Pages 2172-2178, Las Vegas, June 25-28, 2001
- Yuan-Shin Hwang,
Parallelizing Graph Construction Operations in Programs with Cyclic Graphs
,
Proceedings of the 2001 International Conference
on Parallel and Distributed Computing and Systems (PDCS 2001), Pages 100-105, Anaheim, August 21-24, 2001
- Yuan-Shin Hwang and Joel Saltz,
Identifying Parallelism in Programs with Cyclic Graphs,
Proceedings of the 2000 International Conference on Parallel
Processing, Pages 201-208, Toronto, Canada, August 21-24, 2000
(ICPP'2000 Best Paper Award)
- Yuan-Shin Hwang and Joel Saltz,
Identifying DEF/USE Information of Statements that Construct and
Traverse Dynamic Recursive Data Structures,
Lecture Notes in Computer Science, Volume 1366,
Languages and Compilers for Parallel Computing (LCPC'97 Issue), Pages 131-145,
Z. Li, P.-C. Yew, S. Chatterjee, C.-H. Huang, P. Sadayappan, and D. Sehr
(Editors), 1998
- Yuan-Shin Hwang and Joel Saltz,
Side Effect Analysis on User-Defined Reduction Functions with
Dynamic Pointer-Linked Data Structures,
Lecture Notes in Computer Science, Volume 1239,
Languages and Compilers for Parallel Computing (LCPC'96 Issue), Pages 594-596,
David Sehr, Utpal Banerjee, David Gelernter, Alex Nicolau, and David Padua
(Editors), 1997
- Joel Saltz, Gagan Agrawal, Chialin Chang, Raja Das, Guy Edjlali,
Paul Havlak, Yuan-Shin Hwang, Bongki Moon, Ravi Ponnusamy, Shamik Sharma,
Alan Sussman, and Mustafa Uysal,
Programming Irregular Applications: Runtime Support, Compilation,
and Tools
, Advances in Computers, Volume 45, Chapter 3, Pages 105-153,
Academic Press, 1997
- Yuan-Shin Hwang, Bongki Moon, Shamik D. Sharma, Ravi Ponnusamy, Raja Das,
and Joel H. Saltz,
Runtime and Language Support for Compiling Adaptive Irregular Programs,
Software Practice & Experience, Volume 25, No. 6, Pages 597-621, June 1995
- Yuan-Shin Hwang, Raja Das, Joel H. Saltz, Milan Hodoscek,
and Bernard Brooks,
Parallelizing Molecular Dynamics Programs for
Distributed Memory Machines,
IEEE Computational Science & Engineering, Volume 2, No. 2, Pages 18-29,
Summer 1995
- Ravi Ponnusamy, Joel H. Saltz, Alok Choudhary, Yuan-Shin Hwang,
and Geoffrey Fox,
Runtime Support and Compilation Methods for User-Specified Irregular
Data Distributions,
IEEE Transactions on Parallel and Distributed Systems, Volume 6, No. 8,
Pages 815-831, August 1995
- Ravi Ponnusamy, Yuan-Shin Hwang, Raja Das, Joel H. Saltz, Alok Choudhary,
and Geoffrey Fox,
Supporting Irregular Distributions Using Data-Parallel Languages,
IEEE Parallel and Distributed Technology, Volume 3, No. 1, Pages 12-24,
Spring 1995
- Yuan-Shin Hwang, Raja Das, and Joel H. Saltz,
A Data-Parallel Implementation of Molecular Dynamics Programs for
Distributed Memory Machines,
Proceedings of First IPPS Workshop on Solving Irregular Problems
on Distributed Memory Machines, Pages 28-34, Santa Barbara, California,
April, 1995
- Yuan-Shin Hwang, Bongki Moon, Shamik Sharma, Raja Das, and Joel H. Saltz,
Runtime Support to Parallelize Adaptive Irregular Programs,
Proceedings of the SIAM Workshop on Environments and Tools
for Parallel Scientific Computing, Pages 19-32, May, 1994
- Joel Saltz, Ravi Ponnusamy, Raja Das, Yuan-Shin Hwang, Bongki Moon,
and Shamik Sharma,
An Overview of Irregular Problem Runtime Support,
Proceedings of the First International Workshop
on Parallel Processing, Bangalore, India, December 1994
- Shamik D. Sharma, Ravi Ponnusamy, Bongki Moon, Yuan-Shin Hwang, Raja Das,
Run-time and Compile-time Support for Adaptive Irregular Problems,
Proceedings Supercomputing '94, Pages 97-106, November, 1994
- Raja Das, Mustafa Uysal, Joel H. Saltz, and Yuan-Shin Hwang,
Communication Optimizations for Irregular Scientific Computations on
Distributed Memory Architectures,
Journal of Parallel and Distributed Computing, Volume 22, No. 3, Pages 462-479,
September, 1994
- Raja Das, Yuan-Shin Hwang, Mustafa Uysal, Joel H. Saltz, and Alan Sussman,
Applying the CHAOS/PARTI Library to Irregular Problems in
Computational Chemistry and Computational Aerodynamics,
Proceedings of the 1993 Scalable Parallel Libraries Conference,
Pages 45-56, Mississippi State University, Starkville, Mississippi,
October, 1993