Pen-Chung Yew Professor Department of Computer Science and Engineering University of Minnesota at Twin Cities 4-192 Keller Hall 200 Union Street, SE Minneapolis, MN 55455, USA Contact: <yew> {at} cs
[dot] umn [dot] edu Education
Ph.D. 1981 University
of Illinois at Urbana-Champaign, Computer Science. M.S.
1977 University of Massachusetts at Amherst,
Computer Engineering B.S. 1972 National Taiwan University, Electrical
Engineering Publications (in
chronological order, updated 4/28/2015)
A.
Journal Papers ·
Z. Fang, S. Mehta, P.C. Yew, A. Zhai,
J. Greensky, G. Beeraka,
B. Zang, Measuring
Micro-architectural Details of Multi- and Many-core Memory Systems Through Micro-benchmarking, ACM Transactions on Architecture and Code Optimization (TACO), Vol.11,
Issue 4, January 2015. ·
F. Lv, L. Liu, M.H. Cui, L.
Wang, Y. Liu, X. Feng, P.C. Yew, WiseThrottling: A New Asynchronous Task Scheduler for
Mitigating I/O Bottleneck in Large-Scale Datacenter Servers, J. of
Supercomputing, 2015 ·
A. Holey, V. Mekkat, P.C. Yew,
A. Zhai, Performance-Energy
Considerations for Shared Cache Management in a Heterogeneous Multicore
Processor, ACM Transactions on Architecture and Code
Optimization (TACO), Vol 12, Issue 1, March 2015. ·
C. Wu, J. Li, D. Xu, P.C. Yew,
J. Li, and Z. Wang, FPS: A Fair-progress Process
Scheduling Policy on Shared-Memory Multiprocessors, IEEE Transactions on Parallel and Distributed
Systems (TPDS), Vol. 26, No. 2, February 2015, pp. 444-454 ·
Lv, H.M. Cui, L. Wang, L. Liu, C.G. Wu,
X.B. Feng and P.C. Yew, Dynamic I/O-Aware Scheduling for Batch-Mode Applications on Chip
Multiprocessor Systems of Cluster Platform, J. of Computer Science and
Technology (JCST), 29(1): 21-37, 2014 ·
D.Y. Hong, J.J.
Wu, P.C. Yew, W.C. Hsu, C.C. Hsu, P. Liu, C.M. Wang and Y.C. Chung, Efficient and Retargetable
Dynamic Binary Translation on Multicores, IEEE Transactions on Parallel
and Distributed Systems (TPDS), Vol. 25, No. 3, March 2014, pp. 622-632 ·
S. Mehta, G. Beeraka and P.C. Yew, Tile
Size Selection Revisited, ACM Transactions on
Architecture and Code Optimization (TACO), Vol. 10, No. 4, December 2013 · L. Gao, L. Li, J.L. Xue and P.C.
Yew, SEED: A Statically-Greedy and
Dynamically-Adaptive Approach for Speculative Loop Execution, IEEE Transaction
on Computers (TC), Vol. 62, No. 5, May 2013, pp. 1004-1016 · S.Y.
Wang, P.C. Yew and A. Zhai, Code Transformations for Enhancing the Performance of Speculatively
Parallel Threads, J. of Circuits, Systems and Computers (JCSC), No. 2,
Vol. 21, 2012 · Z. Wang,
C. Wu, P.C. Yew, J.J. Li and X. Di, On-the-Fly
Structure Splitting for Heap Objects, ACM Transactions
on Architecture and Code Optimization (TACO), January 2012 ·
H. Chen, R. Chen, F. Zhang, B. Zang and
P.C. Yew, Mercury: combining
performance with dependability using self-virtualization, Journal of
Computer Science and Technology (JCST), 2012. ·
H. Chen, J. Wu, C. Huang, P.C. Yew and B. Zang, Dynamic
Software Updating Using a Relaxed Consistency Model, IEEE Trans. on
Software Engineering (TSE), Vol. 37, No. 5, Sept/Oct 2011, pp. 679-694 ·
P. Woodward, J. Jayaraj,
P.H. Lin, P.C. Yew, Moving Scientific
Codes to IBM Cell Processor and Other Multicore Microprocessor CPUs, IEEE
Computing in Science and Engineering, Vol.10, No.6, pp.16-25, Nov/Dec. 2008 ·
S.V. Kodakara,
J. Kim, D.J. Lilja, D. Hawkins, W.C. Hsu, and P.C.
Yew, CIM: A Reliable Metric for
Evaluating Program Phase Classifications, IEEE Computer Architecture
Newsletter (CAN), 2007 ·
J. Lin, W.C. Hsu, P.C. Yew, R.D.C. Ju, and T.F. Ngai, Recovery Code Generation for General
Speculative Optimizations, ACM Transactions on Architecture and Code
Optimization (TACO), Vol.3, No.1, March 2006, pp. 67-89 ·
J.S. Kong, P.C. Yew and G.H. Lee, Minimizing the Directory Size for
Large-Scale Shared-Memory Multiprocessors, IEICE Trans. on Information
and Systems, Vol. E88-D No.11 2533, November 2005, pp. 2533-2543. ·
J. Lin, T. Chen, W.C. Hsu, P.C. Yew,
R.D.C. Ju, T.F. Ngai and
S. Chan, A Compiler Framework for
Speculative Optimizations, ACM Transactions on Architecture and Code
Optimization (TACO), Vol.1, No.3, September 2004, pp. 247-271 ·
J. Lu, H. Chen, P.C. Yew, W.C. Hsu, Design and Implementation of a Lightweight
Dynamic Optimization System, Journal of Instruction-Level Parallelism,
Volume 6, 2004 ·
P.Y. Tang and P.C. Yew, Interprocedural Induction Variable Analysis,
International Journal of Foundation of Computer Science, World Scientific,
Vol.14, No.3, June 2003, pp.405-423 ·
S.J. Lee and P.C. Yew, On Augmenting Trace Cache for
High-Bandwidth Value Prediction, IEEE Tran. on
Computers (TC), Vol.51, No. 9, September 2002, pp. 1074-1088. ·
S.J. Lee and P.C. Yew, On Table Bandwidth and Its Update Delay
for Value Prediction on Wide-Issue ILP Processors, IEEE Transactions on
Computers (TC), Vol. 50,
No.8, August 2001, pp.847-852. ·
H.B. Lim and P.C. Yew, Efficient Integration of Compiler-Directed
Cache Coherence and Data Prefetching, Journal of Parallel and Distributed
Computing (JPDC), Vol. 61, No. 12, Dec 2001, pp. 1775-1802 ·
S.Y. Cho, P.C. Yew and G. Lee, A High-Bandwidth Memeory
Pipeline for Wide-Issue Processors, IEEE Trans. on Computers (TC), Vol.
50, No.7 , July 2001, pp. 709-723. ·
L. Choi and P.C. Yew, Compiler Analysis for Cache Coherence: Interprocedural
Array Data-Flow Analysis and Its Impact on Cache Performance, IEEE Trans.
on Parallel and Distributed Systems (TPDS), Vol. 11, No. 9, Sept 2000, pp.
879-896. ·
L. Choi and P.C. Yew, Hardware and Compiler-Directed Cache Coherence in Large-Scale
Multiprocessors, the IEEE Trans. on Parallel and Distributed Systems
(TPDS), Vol. 11, No. 4, April 2000, pp. 375-394. ·
I.H. Kazi, et
al., JaViz: A
Client/Server Java Profiling Tool, a special issue on Java technology in
IBM Systems Journal, Vol. 39, No.1.1, 2000. ·
J.Y. Tsai, et al., The Superthreaded Architecture, a special
issue on multithreaded architectures in the IEEE Trans. on Computers (TC), Vol 48, No. 9, September 1999, pp. 881-903. ·
D.K. Chen and P.C. Yew, Redundant Synchronization Elimination for Doacross Loops, IEEE Trans. on Parallel and
Distributed Systems (TPDS), Vol.10, No. 5, May 1999. ·
H.B. Lim and P.C. Yew, Maintaining Cache Coherence Through
Compiler-Directed Data Prefetching, Journal of Parallel and Distributed
Computing (JPDC), Vol 53, No. 2, pp. 144-173, Sep
1998. ·
J.Y. Tsai, Z. Jiang, and P.C. Yew, Compiler Techniques for the Superthreaded Architectures, a Special Issue on
Languages and Compilers for Parallel Computing, International Journal of
Parallel Programming, June 1998. ·
J.Y. Tsai, P.C. Yew, et al, Integrating Parallelizing Compilation
Technology and Processor Architecture for Cost-Effective Concurrent
Multithreading, a special issue in Journal of Information Science and Eng, No. 14, pp.205-222, March 1998 ·
S. Adve, et al, The Interaction of Architecture and
Compilation Technology for High-Performance Processor Design, IEEE
Computers, December 1997 ·
W.T. Hsu and P.C. Yew, Performance Evaluation of Wire-Limited
Hierarchical Networks, Journal of Parallel and Distributed Computer
(JPDC), Vol. 41, June 1997, pp 156-172. ·
J.Y. Tsai and P.C. Yew, Enhancing Multiple-Path Speculative
Execution with Predicate Window Shifting, a special issue on
Microprocessor Architecture in Journal of System Architecture, June 1997 ·
L. Choi, H.B. Lim and P.c. Yew, Multiprocessor Cache Coherence: The
Compiler-Directed Approach, IEEE Parallel & Distributed Technology,
Winter 1996, pp.23-35 ·
D.K. Poulsen and
P.C. Yew, Integrating Fine-Grained
Message Passing in Cache Coherent Shared-Memory Multiprocessors, Journal
of Parallel and Distributed Computing (JPDC), Vol. 33, No. 2, March 1996, pp.
172-188. ·
D.K. Chen and P.C. Yew, On Effective Execution of Non-Uniform Doacross Loops, IEEE Trans. on Parallel and
Distributed Systems (TPDS), Vol. 7, No. 5, May 1996, pp. 463-476. ·
J.D. Bruner, C.J. Beckmann, P. Konas, D.K. Poulsen and P.C.
Yew, Chief: A Simulation Environment
for Studying Parallel Systems, International Journal of Computer
Simulation, Vol.6, No.1, 1996, pp. 89-100. ·
D.J. Lilja and
P.C. Yew, Improving Memory Utilization
in Cache Coherence Directories, IEEE Trans. on Parallel and Distributed
Systems (TPDS), Vol. 4, No.10, Oct. 1993, pp. 1130-1146. ·
W.T. Hsu and P.C. Yew, An Effective Synchronization Network for
Hot Spot Accesses, ACM Trans. on Computing Systems (TOCS), Vol. 10, No.3,
Aug. 1992, pp. 167-189. ·
Z. Shen, Z. Li
and P.C. Yew, An Empirical Study on
Program Characteristics for Parallelizing Compilers, IEEE Trans. on
Parallel and Distributed Systems (TPDS), Vol. 1, No. 3, July 1990, pp.
356-364. ·
Tim Davis and P.C. Yew, A Stable Non-Deterministic Parallel
Algorithm for General Unsymmetric Sparse LU
Factorization, SIAM J. on Matrix Analysis and Applications, Vol. 2, No.
3, July 1990, pp. 383-403. ·
Z. Li, P.C. Yew and C.Q. Zhu, An Efficient Data Dependence Analysis for
Parallelizing Compiler, IEEE Trans. on Parallel and Distributed Systems
(TPDS), Vol. 1, No. 1, Jan. 1990, pp. 26-34. ·
Z. Fang, P. Tang, P.C. Yew and C.Q. Zhu, Dynamic Processor Self-Scheduling for
General Parallel Nested Loops, IEEE Trans. on Computers (TC), Vol. 39,
No. 7, July 1990, pp. 919-929. ·
P. Tang, P.C. Yew, Software Combining Algorithms for Distributing Hot-Spot Addressing,
J. of Parallel and Distributed Computing (JPDC), Vol. 10, No.2, Oct. 1990,
pp. 130-139. ·
N.F. Tzeng and
P.C. Yew and C.Q. Zhu, Realizing
Fault-Tolerant Interconnection Networks via Chaining, IEEE Trans. on
Computers (TC), Vol. 37, No. 4, pp. 458-462, April 1988. ·
Z. Li and P.C. Yew, Program Parallelization with Interprocedural
Analysis, J. of Supercomputing, Kluwer Academic Publishers, 1988, pp.
225-244. ·
C.Q. Zhu and P.C. Yew, A Scheme to Enforce Data Dependence on
Large Multiprocessor Systems, IEEE Trans. on Software Engineering (TSE),
Vol. SE-13, No. 6, pp. 726-739,
June 1987. ·
P.C. Yew, N.F. Tzeng
and D.H. Lawrie,, Distributing Hot Spot Addressing in Large Scale Multiprocessors,
IEEE Trans. on Computers (TC), Vol. C-36, No. 4, pp. 388-395, April 1987. ·
P.C. Yew, D.A. Padua and D.H. Lawrie,, Stochastic Properties of a Multiple-Layer Single-Stage
Shuffle-Exchange Network in a Message Switching Environment, J. of
Digital Systems, Vol. 6, No. 4, pp. 387-410, 1982. ·
P.Y. Chen, D.H. Lawrie,
P.C. Yew and D.A. Padua, Interconnection
Networks Using Shuffle, IEEE Computer, Vol. 14, No. 12, pp. 55-64,
December 1981. ·
P.C. Yew and D.H. Lawrie,
An Easily Controlled Network for
Frequently Used Permutations, IEEE Trans. on Computers (TC), Vol. C-30,
No.4, pp. 296-298, April 1981. B. Refereed Conference Papers · S. Mehta and P.C. Yew, Improving
Compiler Scalability: Optimizing Programs at Small Price, Proc. of ACM
SIGPLAN Int’l Conf. on Programming Languages Design and Implementation
(PLDI), June 2015 · X. Yuan, C. Wu, Z. Wang, J. Li, X. Feng,
P.C. Yew, Y. Lan, Y. Chen, J. Huang, Y. Guan, Reproducing Concurrency Bugs using Local
Clocks, Proc. of Int'l Conf. on Software Engineering
(ICSE), May, 2015 · W. Wang, C. Wu, P.C. Yew, X. Shen,
X. Yuan, Z. Wang, J. Li, X. Feng, Localization of Concurrency Bugs Using
Shared Memory Access Pairs, 29th IEEE/ACM International
Conference on Automated Software Engineering (ASE), September 2014 · S. Mehta,
Z. Fang, A. Zhai and P.C. Yew, Multistage Coordinated Prefetching for Present-Day Processors, Proc. of the 2014 Int't Conf. on Supercomputing (ICS), June 2014 · C.F. Chen,
C.C. Chen, et al, DAPs: Dynamic Adjustment and Partial Sampling for
Multithreaded/Multicore Simulation,
Proc. of 51th International Design Automation Conference (DAC), June 2014 · Y.H. Lu,
D.Y. Hong, T.Y. Wu,
J.J. Wu, P. Liu, W.C. Hsu, and P.C. Yew, DBILL: An Efficient and Retargetable
Dynamic Binary Instrumentation Framework using LLVM Backend, Proc. of 10th Int'l Conf. on
Virtual Execution Environments (VEE), March 2014 · C.R.
Chang, J.J. Wu, P. Liu, W.C. Hsu, and P.C. Yew, Efficient Memory Virtualization for Cross-ISA
System Mode Emulation,
Proc. of 10th
Int'l Conf. on Virtual Execution Environments (VEE), March 2014 · S. Mehta,
P.H. Lin, and P.C. Yew, Revisiting Loop Fusion in the Polyhedral Framework, Proc. of ACM SIGPLAN 19th Annual Symp. on Principles and Practice
of Parallel Programming (Ppopp), February 2014 · S.H. Chen,
S.M. Lin, K.Y. Chen, Y.H. Chang, P.C. Yew, C.C. Ho, A Systematic Methodology for OS Benchmarks Characterization,
Proc. of ACM Int’l Conf. on Reliable and Convergent Systems (RACS), October
2013 · V. Mekkat, A. Holey, P.C. Yew and A. Zhai,
Managing Last-Level Cache in a
Heterogeneous Multicore Processor, Proc. of Int'l Conf. on
Parallel Architectures and Compiler Techniques (PACT), September 2013. · X. Yuan,
C. Wu, P.C. Yew, W. Wang, Z. Wang, J. Li and D. Xu,
Synchronization Identification through
On-the-Fly Test, Proc. of 2013 Euro-Par Conference (Euro-Par), August
2013 · C.C. Hsu,
J.J. Wu, P.C. Yew, D.Y. Hong, C.M. Wang, and W.C. Hsu, Improving Dynamic Binary Optimization Through Early-Exit Guided Code
Region Formation, 9th Int'l Conf on Virtual Execution
Environments (VEE), March 2013 · P.H. Lin,
J. Jayaraj, P. Woodward, and P.C. Yew, A Study of Performance Portability Using
Piecewise-Parabolic Method (PPM) Gas Dynamics Applications, Proc. of Int’l Conf. on Computational Science (ICCS), May 2012 · D. Xu, C. Wu, P.C. Yew, J.
Li, and Z. Wang, Providing Fairness on Shared
Memory Multiprocessors via Process Scheduling, ACM SIGMETRICS Performance, June 2012 · D.Y. Hong, C.C. Hsu, P.C. Yew, J.J. Wu, W.C. Hsu,
Y.C. Chung, P. Liu and C.M. Wang, HQEMU:
A Multi-Threaded and Retargetable Dynamic Binary
Translator on Multicores, Proc. of the 10th Annual IEEE/ACM
Int'l Symp. on Code
Generation and Optimization (CGO), March, 2012 · C.C. Hsu, P. Liu, C.M. Wang, J.J. Wu,
D.Y. Hong, P.C. Yew and W.C. Hsu, LnQ: Building High
Performance Dynamic Binary Translators with Existing Compiler Backends,
Proc. of the 40th International Conference on Parallel
Processing (ICPP), Taipei, Taiwan,
September 2011 · D. Xu, C.
Wu and P.C. Yew, On Mitigating Memory Bandwidth
Contention Through Bandwidth-Aware Scheduling, Proc. of Int'l Conf. on Parallel Architectures and Compiler
Techniques (PACT), September 2010. · P. Woodward, et al, Boosting the Performance of Computational Fluid Dynamics Codes for Interactive Supercomputing, Proc. Of Int’l Conf. on Computational Science (ICCS), May 2010 · J. Lin and P.C. Yew, A Compiler Framework for General Memory
Layout Optimization Targeting Structures and Arrays, The 12th Annual Workshop on the
Interaction between Compilers and Computer Architecture (INTERACT),
March 2010. · Z. Wang, C. Wu and P.C. Yew, On Improving Heap Memory Layout by Dynamic
Pool Allocation, Proc. of the 8th
Annual IEEE/ACM Int'l Symp. on
Code Generation and Optimization (CGO), April, 2010. · L. Wang, et.
al., An
Adaptive Task Creation Strategy for Work-Stealing Scheduling, Proc. of the 8th Annual
IEEE/ACM Int'l Symp. on
Code Generation and Optimization (CGO), April, 2010. · H. Chen, L. Yuan, X. Wu, B. Zang, B. Huang, P.C. Yew, Control Flow Obfuscation with Information Flow Tracking, Proc. of the 42nd Int'l Symp. on Microarchitecture
(MICRO-42), November 2009 · V. Packirisamy, A. Zhai, W.C. Hsu, T.F. Ngai, P.C. Yew, Exploring Speculative Parallelism in SPEC2006, Proc. of IEEE Int’l Symp. On Performance Analysis of Systems and Software (ISPASS), April 2009 · Y. Duan,
X. Feng, P.C. Yew, Detecting and Eliminating Violation of Sequential Consistency for
Concurrent C/C++ Programs, Proc. of IEEE/ACM Int'l Symp.
on Code Generation and Optimization (CGO), March
2009 · (Best Paper
Award) V. Packirisamy, Y. Luo, W.L. Hung, A. Zhai and
P.C. Yew, Efficiency of Thread-Level
Speculation in SMT and CMP Architectures – Performance, Power and
Thermal Perspective, Proc. Of Int’l Conf. on Computer Design (ICCD), Oct.
2008. · H. Chen, X. Wu, L. Yuan, B. Zang,
P.C. Yew, F.T. Chong, From Speculation
Security: Practical and Efficient Information Flow Tracking Using Speculative
Hardware, Proc.
of 35th Int'l Symp. on
Computer Architecture (ISCA-35), June 2008 · G.J. He, A. Zhai and P.C.
Yew, Ex-Mon: An Architectural Framework
for Dynamic Program Monitoring on Multicore Processors, The 12th Annual
Workshop on the Interaction between Compilers and Computer Architecture
(INTERACT), Feb 2008. · (Best
Paper Award) H. Chen, R. Chen, F. Zhang, B. Zang and P.C. Yew, Mercury:
combining performance with dependability using self-virtualization, Proc.
of Int’l Conf. on Parallel Processing (ICPP), Sept. 2007 · S.V. Kodakara, J. Kim,
W.C. Hsu, D.J. Lilja, P.C. Yew, Analysis of Statistical Sampling in
Microarchitecture Simulation: Metric, Methodology and Program
Characterization, Proc. of Int'l Symp. on Workload Characterization (IIWCS), Sept, 2007 · J. Kim, W.C. Hsu and P.C.
Yew, COBRA: An Adaptive Runtime Binary Optimization Framework
for Multithreaded Applications,
Proc. of Int’l Conf. on Parallel Processing (ICPP), Sept. 2007 · S.J. Lee, H.K. Lee, and P.C. Yew, Runtime Performance Projection Model for
Dynamic Power Management, Proc. of Asia-Pacific Computer Systems
Architecture Conference (ACSAC), Aug. 2007 · J. Kim, S.V. Kodakara,
W.C. Hsu, D.J. Lilja, R. Geva,
P.C. Yew, Entropy-Based Profile
Characterization and Classification for Automatic Profile Management,
Proc. of Asia-Pacific Computer Systems Architecture Conference (ACSAC), Aug.
2007 · H. Chen, J. Yu, C. Rong, B.Y. Zang and P.C. Yew, POLUS: A Powerful Live Updating Systems,
Proc. of Int'l Conf. on Software Engineering (ICSE), May, 2007 · R. Fu, A. Zhai, P.C. Yew
and W.C. Hsu, J. Lu, Reducing Queueing Stalls Caused by Data Prefetching, The 11th
Annual Workshop on the Interaction between Compilers and Computer
Architecture (INTERACT) May 2007. · V. Packirisamy,
S.Y. Wang, A. Zhai, W.C. Hsu, P.C. Yew, Supporting Speculative Multithreading on
Simultaneous Multithreaded Processors, in Proc. of Int'l Conf. on High
Performance Computing (HiPC), Bangalore, India, Dec
2006 · S.Y. Wang, A. Zhai, P.C. Yew, Exploiting
Speculative Thread-Level Parallelism in Data Compression Applications, in
Proc. of 19th Workshop on Languages and Compiler for Parallel Computing (LCPC), New Orlean, LA Nov. 2006 · H.B. Chen, R. Chen, F.Z.
Zhang, B.Y. Zang, P.C. Yew, Live Updating Operating Systems
Using Virtualization, 2nd Int'l Conf on Virtual
Execution Environments (VEE), June 2006 · J. Kim, S.V. Kodakara, W.C. Hsu, D.J. Lilja,
P.C Yew, Dynamic Code Region (DCR)
Based Program Phase Tracking and Prediction for Dynamic Optimizations,
Lecture Notes in Computer Science, Volume 3793 (HiPEAC),
Oct 2005, pp. 203 - 217. · X. Dai, A. Zhai, W.C. Hsu and P.C. Yew, A General Compiler Framework for Speculative Optimizations Using Data
Speculative Code Motion, Proc. of the Third Annual IEEE/ACM Int'l Symp. on Code Generation and
Optimization (CGO), March 2005, pp. 280-290 · A. Das, J. Lu, H. Chen,
J. Kim, P.C. Yew, W.C. Hsu, D.Y. Chen, Performance
of Runtime Optimization on BLAST, Proc. of the Third Annual IEEE/ACM
Int'l Symp. on Code
Generation and Optimization (CGO), March 2005, pp. 86-96 · J. Lin, W.C. Hsu, P.C.
Yew, R.D. Ju and T.F. Ngai,
A Compiler Framework for Recovery Code
Generation in General Speculative Optimizations, Proc. of Int'l Conf. on
Parallel Architectures and Compiler Techniques (PACT), September 2004, pp.
17-28 · T. Chen, J. Lin, X. Dai,
W.C. Hsu and P.C. Yew, Data Dependence
Profiling for Speculative Optimizations, Proc. of 14 Int'l Conf. on
Compiler Construction (CC),
March 2004, pp. 57-62 · H. Chen, J. Lu, W.C. Hsu,
P.C. Yew, Continuous Adaptive
Object-Code Re-optimization Framework, Ninth Asia-Pacific Computer
Systems Architecture Conference (ACSAC), pp. 241-255, Sept 2004. · J. Lu, H. Chen, R. Fu,
W.C. Hsu, B. Othmer and P.C. Yew, The Performance of Runtime Data Cache
Prefetching in a Dynamic Optimization System, Proc. of 36th Annual Int'l Symp. on Microarchitecture
(MICRO-36), December 2003 · J. Lin, T. Chen, W.C.
Hsu, P.C. Yew, R.D. Ju and T.F. Ngai,
A Compiler Framework for Speculative
Analysis and Optimizations, Proc. of ACM/SIGPLAN Conf. on Programming
Language Design and Implementation (PLDI), June 2003, pp.289-299 · H. Chen, W.C. Hsu, J. Lu,
B. Othmer, D.Y. Chen, and P.C. Yew, Dynamic Trace Selection Using Performance
Monitoring Hardware Sampling, Proc. of the 1st IEEE/ACM Int'l Symp. on Code Generation and
Optimization (CGO), March 2003, pp. 79-90 · J. Lin, T. Chen, W.C. Hsu
and P.C. Yew, Speculative Register
Promotion Using Advanced Load Address Table (ALAT), Proc. of the 1st
IEEE/ACM Int'l Symp. on
Code Generation and Optimization (CGO), March 2003, pp. 125-134 · T. Chen, J. Lin, W.C. Hsu
and P.C. Yew, An Empirical Study on the
Granularity of Pointer Analysis in C Programs, Proc
of the 15th Workshop on Languages and Compilers for Parallel Computing
(LCPC), Aug. 2002 · W.C. Hsu, H. Chen, P.C.
Yew and D.Y. Chen, On the
Predictability of Program Behavior Using Different Input Data Sets, Proc.
of the 6th Workshop on Interaction Between Compilers and Computer
Architectures (INTERACT-6), Feb 2002. · P.Y. Tang and P.C. Yew, Interprocedural Induction Variable Analysis, Proc. of
6th Int'l Symp. on
Parallel Architectures, Algorithms and Networks (I-SPAN), pp. 245-250, May
2002. · T. Chen, J. Lin, W.C. Hsu
and P.C. Yew, An Empirical Study on the
Characteristics of Heap-Oriented Pointers in C Programs, Proc. of 6th
Int'l Symp. on Parallel
Architectures, Algorithms and Networks (I-SPAN), pp.251-256, May 2002 · S.J. Lee and P.C. Yew, On Some Implementation Issues for Value
Prediction on Wide-Issue ILP Processors, Proc. of Int'l Conf. on Parallel
architectures and Compiler Techniques (PACT), Oct. 2000, pp.145-156 · S.J. Lee, Y. Wang and
P.C. Yew, Decoupled Value Prediction on
Trace Processors, Proc. of Int'l Conf on
High-Performance Computer Architecture (HPCA-6), Jan 2000, pp.231-240 · (Best Paper Award) H.B. Lim and P.C. Yew, Efficient Integration of Compiler-Directed
Cache Coherence and Data Prefetching, Proc. of the 2000 Int'l Parallel
and Distributed Processing Symposium (IPDPS) (Best Paper Award), May 2000,
pp. 331-342 · S.Y. Cho, P.C. Yew and
G.H. Lee, Access Region Locality for
High-Bandwidth Processor Memory System Design, Proc. of the 32nd Int'l Symp. on Microarchitecture
(MICRO-32), Nov. 1999, pp.136-146 · S.Y. Cho and P.C. Yew, Decoupling Local Variable Accesses in a
Wide-Issue Superscalar Processor, Proc. of the 26th Intn'l
Symp. on Computer
Architecture (ISCA-26), May 1999, pp.100-110 · B. Zheng,
et. al., Designing the Agassiz Compiler for
Concurrent Multithreaded Architectures, Proc. of the 12th Workshop on
Languages and Compilers for Parallel Computing (LCPC-12), Aug. 1999 · J.Y. Tsai, Z. Jiang, E.
Ness, and P.C. Yew, Performance of a
Concurrent Multithreaded Processors, Proc. of the 4th International
Symposium of High Performance Computer Architectures (HPCA-4), Feb. 1998, pp.
24-34 · S. Cho, J.Y. Tsai, et.al,
High-Level Information - An Approach
for Integrating Front-end and Back-end Compilers, Proc
of the 1998 Int'l Conf on Parallel Processing
(ICPP), Aug. 1998, pp. 346-355 · H.B. Lim and P.C. Yew, An Integrated Framework for
Compiler-Directed Cache Coherence and Data Prefetching, Proc. of the 11th
Workshop on Languages and Compilers for Parallel Computing (LCPC-11), Aug.
1998 · H.B. Lim, and P.C. Yew, A Compiler-Directed Cache Coherence Scheme
Using Data Prefetching, Proc. of the Int'l Symp.
on Parallel Processing (IPPS), Apirl
1997, pp. 643-649 · J.Y. Tsai, B. Zheng, and P.C. Yew,
Program Optimization for Concurrent Multithreaded Architectures, Proc. of
the 10th Workshop on Languages and Compilers for Parallel Computing
(LCPC-10), Aug. 1997 · L. Choi and P.C. Yew, Compiler and Hardware Support for Cache
Coherence in Large-Scale Multiprocessors: Design Considerations and
Performance Evaluation, Proc. of the 23rd Int'l Symp
on Computer Architecture (ISCA-23), May 1996, pp. 283-294 · L. Choi and P.C. Yew, Program Analysis for Cache Coherence:
Beyond Procedural Boundaries, Proc. of the 1996 Int'l Conf. on Parallel
Processing (ICPP), Aug. 1996, Vol. 3, pp. 103-114, · L. Choi and P.C. Yew, Eliminating Stale Data References through
Array Data-Flow Analysis, Proc. of the 1996 Int'l Symp.
on Parallel Processing (IPPS), April, 1996, pp. 4-13
· J.Y. Tsai and P.C. Yew, The Superthreaded
Architecture: Thread Pipelining for Run-Time Data Dependence Checking and
Control Speculation, Proc. of the 1996 Int'l Conf. on Parallel
architectures and Compiler Techniques (PACT), Oct. 1996, pp.35-46 · W.T. Hsu and P.C. Yew, Let Us Build System-Friendly Networks
– Build Them Hierarchically, invited paper for 1996 ICPP Workshop
on Challenges for Parallel Processing, Aug 1996 · Z. Li, J.Y. Tsai, X.
Wang, P.C. Yew and B. Zheng, Compiler Techniques for Concurrent Multithreading with Hardware
Speculation Support, Proc. of the 9th Workshop on Languages and Compilers
for Parallel Computing (LCPC-9), Aug. 1996 · H.B. Lim, L. Choi and
P.C. Yew, On Using Data Prefetching for
Cache Coherence in Multiprocessors, Proc. of the 9th Workshop on
Languages and Compilers for Parallel Computing (LCPC-9), Aug. 1996 · P. Konas
and P.C. Yew, Processor Self-Scheduling
in Parallel Discrete Event Simulation, Proc. of the 1995 Winter
Simulation Conference, December 1995. · L. Choi and P.C. Yew, Interprocedural Array Data-Flow Analysis for Cache
Coherence, Eighth Workshop on Languages and Compilers for Parallel
Computing (LCPC-8), August 1995. · P. Konas
and P.C. Yew, Partitioning for
Synchronous Parallel Simulation, Proc. of the ACM/IEEE/SCS 9th Workshop
on Parallel and Distributed Simulation, 1995 · D.K. Chen, J. Torrellas and P.C. Yew, An Efficient Algorithm for the Run-Time Parallelization of Doacross Loops, Proc. of Supercomputing '94, pp.
518-527 · L. Choi and P.C. Yew, A Compiler-Directed Cache Coherence Scheme
with Improved Intertask Locality, Proc. of
Supercomputing '94, pp. 773-782 · D. Poulsen
and P.C. Yew, Data Prefetching and Data
Forwarding in Shared-Memory Multiprocessors, Proc. of the Int'l Conf. on
Parallel Processing (ICPP), Vol. II, Aug. 1994, pp. 276-280 · D.K. Chen and P.C. Yew, Statement Reordering for Doacross Loops, Proc. of the Int'l Conf. on Parallel
Processing (ICPP), Vol.II, Aug. 1994, pp. 24-28 · D.K. Chen and P.C. Yew, Redundant Synchronization Elimination
for Doacross Loops, Prof. of 1994 Int'l
Parallel Processing Symp. (IPPS), April 1994, pp.
477-481 · P. Konas
and P.C. Yew, Improved Parallel
Architectural Simulations on Shared-Memory Multiprocessors, Proc. of the
ACM/IEEE/SCS 8th Workshop on Parallel and Distributed Simulation, July 1994. · D.J. Kuck,
et. al, The Cedar System and an Initial
Performance Study, Proc. of the 20th Symp. on Computer Architecture (ISCA-20), May 1993, pp.213-223 · D.K. Poulsen
and P.C. Yew, Execution-Driven Tools
for Parallel Simulation of Parallel Architecture and Applications, Proc.
of Supercomputing '93, Nov. 1993, pp. 860-869 · D.K. Chen and P.C. Yew, Efficient Synchronization for Doacross Loops Execution, Proc. of 1992 Int'l Conf.
on Parallel Processing (ICPP), Aug. 1992 · W.T. Hsu and P.C. Yew, The Impact of Wiring Constraints on
Hierarchical Network Performance, Proc. of the 1992 Int'l Parallel
Processing Symp. (IPPS), March, 1992, pp.580-588 · P. Konas
and P.C. Yew, Synchronous Parallel
Discrete Event Simulation on Shared-Memory Multiprocessors, Proc. of the
6th Workshop on Parallel and Distributed Simulation, Jan. 1992, pp.12-21. · H.M. Su and P.C. Yew, Efficient Doacross
Execution for Distributed Shared-Memory Systems, Proc. of Supercomputing
'91, Nov. 1991, pp.842-853 · D.K. Chen and P.C. Yew, An Empirical Study on Doacross Loops,
Proc. of Supercomputing '91, Nov. 1991, pp. 620-632 · J. Konicek,
et al, The Organization of the Cedar
System, Proc. of 1991 Int'l Conf. on Parallel Processing (ICPP), Aug.
1991, pp.49-56 · D.J. Lilja
and P.C. Yew, Combining Hardware and
Software Cache Coherence Strategies, Proc. of the 1991 Int't Conf. on Supercomputing (ICS), June 1991, pp.
274-283 · H.B. Lim and P.C. Yew, Parallel Program Behavioral Study on a
Shared-Memory Multiprocessor, Proc. of the 1991 Int'l Conf. on
Supercomputing (ICS), June, 1991, pp. 386-395 · H.M. Su and P.C. Yew, Efficient Interprocessor
Communication on Distributed Shared-Memory Multiprocessors, Proc. of the
1991 Int'l Conf. on Parallel Processing (ICPP), Vol.1, Aug. 1991, pp. 45-48 · W.T. Hsu and P.C. Yew, The Performance of Hierarchical Systems
with Wiring Constraints, Proc. of the 1991 Int'l Conf. on Parallel
Processing (ICPP), Vol. 1, Aug. 1991, pp. 9-16 · P. Konas,
P.C. Yew, Parallel Event Discrete Event
Driven Simulation on Shared-Memory Multiprocessors, Proc. of the 24th
Annual Simulation Symp.,
April, 1991, pp. 134-148. · J. Bruner, H. Cheong, A. Veidenbaum and P. C. Yew, Chief: A Parallel Simulation Environment for Parallel Systems,
Proc. of the 5th Int'l Parallel Processing Symp
(IPPS). April, 1991, pp. 568-575 · W. T. Hsu and P.C. Yew, An Effective Synchronization Network for
Large Multiprocessor Systems, Proc. of the 5th Int'l Parallel Processing Symp (IPPS). May, 1991, pp. 309-317 · D. Lilja
and P.C. Yew, Comparing Parallelism
Extraction Techniques: Superscalar Processors, Pipelined Processors and
Multiprocessors, Proc. of 1990 Int'l Conf. on Parallel Processing (ICPP),
Aug. 1990, pp. 563-564 · P. Tang, P.C. Yew and
C.Q. Zhu, Compiler Algorithms for Data
Synchronization in Nested Parallel Loops, Proc. of 1990 Int'l Conf. on
Supercomputing (ICS), June 1990, pp. 177-186 · D.K. Chen, H.M. Su and
P.C. Yew, The Impact of Synchronization
and Granularity on Parallel Systems, Proc. of 17th Int'l Symp. on Computer Architecture
(ISCA-17), June 1990, pp. 239-249 · P.C. Yew and J. Bruner, SEE: A
System Evaluation Environment for Studying Parallel Systems, Proc. of the
First Workshop on Parallel Processing, Dec. 1990. · H.M. Su and P.C. Yew, On Data Synchronization for Multiprocessors,
Proc. of the 16th Int'l Symp. on
Computer Architecture (ISCA-16), 1989,
pp 416-423 · Z. Shen,
Z. Li and P.C. Yew, An Empirical Study
on Array Subscripts and Data Dependences, Proc. of the 1989 Int'l Conf.
on Parallel Processing (ICPP), Aug. 1989, pp
145-152 · P.Y. Tang and P.C. Yew, A
Parallel Linked List for Shared-Memory Multiprocessors, Proc. of the 1989
Computer Software and Application Conf, Oct. 1989,
pp.130-135. · Z. Li, P.C. Yew and C.Q.
Zhu, Data Dependence Analysis on Multi-Dimensional Array References, Proc. of
the 1989 Int'l Conf. on Supercomputing, June 1989, pp
215-224 · P.A. Emrath,
D.A. Padua and P.C. Yew, Cedar
Architecture and Its Software, 22nd Hawaii Intn'l
Conf. on System Sciences, Jan. 1989, pp 306-315. · Z. Li and P.C. Yew, Efficient Interprocedural
Analysis for Parallel Programs, ACM SIGPLAN Symp.
on Parallel Programming: Experience with
Applications, Languages and Systems, July 1988, pp. 85-99 · P. Tang, P.C. Yew and
C.Q. Zhu, Impact of Self-Scheduling
Order on Performance of Multiprocessor Systems, Proc. of the 1988 Int'l
Conf. on Supercomputing (ICS), pp. 593-603 · Z. Li and P.C. Yew, Interprocedural Analysis for Parallel Computing,
Proc. of the 1988 Int'l Conf. on Parallel Processing (ICPP), pp. 221-228 · W.T. Hsu and P.C. Yew, A Scheme to Enhance Binary N-Cube Networks,
Proc. of the 1987 Int'l Conf. on Parallel Processing (ICPP), pp. 820-823 · R.L. Lee, P.C. Yew and
D.H. Lawrie, Data
Prefetching in Shared Memory Multiprocessors, Proc. of the 1987 Int'l
Conf. on Parallel Processing (ICPP), pp. 28-31 · Z. Fang, P.Y. Tang and
P.C. Yew, C.Q. Zhu, Dynamic Processor
Self-Scheduling for General Parallel Nested Loops, Proc. of the 1987
Int'l Conf. on Parallel Processing (ICPP), pp. 1-10 · R.L. Lee, P.C. Yew and
D.H. Lawrie, Multiprocessor
Cache Design Considerations, Proc. of the 14th Int'l Symp.
on Computer Architecture (ISCA-14), pp. 253-262,
1987 · P.Y. Tang and P.C. Yew, Deadlock Prevention in Processor
Self-Scheduling for Nested Parallel Loops, Proc. of the 1987 Int'l Conf.
on Parallel Processing (ICPP), pp. 11-18, 1987 · N.F. Tzeng,
P.C. Yew and C.Q. Zhu, Fault-Diagnosis
in a Multiple-Path Interconnection Networks, Proc. of the 16th Int'l Symp.on Fault-Tolerance Computing, pp.98-103, July 1986 · P.Y. Tang and P.C. Yew, Processor Self-Scheduling for
Multiple-Nested Parallel Loops, Proc. of the 1986 Int'l Conf. on Parallel
Processing (ICPP), St. Charles, IL., pp. 528-535,
Aug. 1986 · N.F. Tzeng,
P.C. Yew and C.Q. Zhu, A Fault-Tolerant
Scheme for Multistage Interconnection Networks, Proc. of the 12th Int'l Symp. on Computer Architecture
(ISCA-12), pp. 368-375, June 1985 · N.F. Tzeng,
P.C. Yew and C.Q. Zhu, The Performance
of A Fault-Tolerant Multistage Interconnection Network, Proc. of the 1985
Int'l Conf. on Parallel Processing (ICPP), pp. 458-465, Aug. 1985 · C.Q. Zhu and P.C. Yew, A Synchronization Scheme and Its
Applications for Large Multiprocessor Systems, Proc. of the 4th Int'l
Conf. on Distributed Computing Systems, pp. 486-493, May 1984. · Q.X. Xu
and P.C. Yew, Simulations and Analysis
for a Multiprocessor System with Multiprogramming, Proc. of the First
Int'l Conf. on Computers and Applications, June 1984. · P.Y. Chen, P.C. Yew and
D.H. Lawrie, Performance
of Packet Switching in a Buffered Single-Stage Shuffle-Exchange Network,
Proc. of the 3rd Int'l Conf. on Distributed Computing Systems, pp. 622-629,
Oct., 1982. · W. Abu-Sufah, R. Lee, M. Malkawi and
P.C. Yew, Experimental Results on the
Paging Behavior of Numerical Programs, Proc. of the 6th Int'l Conf. on Software
Engineering (ICSE), pp. 110-117, Sept., 1982. · J.E. Lilienkamp,
D.H. Lawrie and P.C. Yew, A Fault Tolerant Interconnection Networks Using Error Correcting
Codes, Proc. of the 1982 Int'l Conf. on Parallel Processing (ICPP),
pp.123-125, Aug. 1982 |