Pen-Chung Yew

 

Professor

Department of Computer Science and Engineering

University of Minnesota at Twin Cities

4-192 Keller Hall

200 Union Street, SE

Minneapolis, MN 55455, USA

Contact: <yew> {at} cs [dot] umn [dot] edu

Education

Ph.D. 1981

   University of Illinois at Urbana-Champaign, Computer Science.

M.S. 1977

University of Massachusetts at Amherst, Computer Engineering

B.S. 1972

National Taiwan University, Electrical Engineering

 

Publications (in chronological order, updated 9/1/2017)

A. Journal Papers

 

      C. Wu, Z. Wang, X. Yuan, Z. Wang, L. Li, P. C. Yew, J. Huang, X. Feng, Y. Lan, Y. Chen, Y. Lai, Y. Guan, Using Local Clocks to Reproduce Concurrency Bugs, IEEE Trans. on Software Engineering (TSE), to appear.

      W. Zhang, X., Ji, Y. Lu, H. Wang, H. Chen, P.C. Yew, Prophet: A Parallel Instruction-Oriented Many-Core Simulator, IEEE Transactions on Parallel and Distributed Systems (TPDS), to appear.

      W. Zhang, X. Ji, S. Yu, H. Chen, T. Li and P.C. Yew, VarCatcher: A Framework for Tackling Performance Variability of Parallel Workloads on Multicores, IEEE Transactions on Parallel and Distributed Systems (TPDS), April 2017.

      S. Mehta and P.C. Yew, Variable Liberalization, ACM Transactions on Architecture and Code Optimization (TACO), to appear in 2016

       Z. Fang, S. Mehta, P.C. Yew, A. Zhai, J. Greensky, G. Beeraka, B. Zang, Measuring Micro-architectural Details of Multi- and Many-core Memory Systems Through Micro-benchmarking, ACM Transactions on Architecture and Code Optimization (TACO), Vol.11, Issue 4, January 2015.

      F. Lv, L. Liu, M.H. Cui, L. Wang, Y. Liu, X. Feng, P.C. Yew, WiseThrottling: A New Asynchronous Task Scheduler for Mitigating I/O Bottleneck in Large-Scale Datacenter Servers, J. of Supercomputing, 2015

      A. Holey, V. Mekkat, P.C. Yew, A. Zhai, Performance-Energy Considerations for Shared Cache Management in a Heterogeneous Multicore Processor, ACM Transactions on Architecture and Code Optimization (TACO), Vol 12, Issue 1, March 2015.

      C. Wu, J. Li, D. Xu, P.C. Yew, J. Li, and Z. Wang, FPS: A Fair-progress Process Scheduling Policy on Shared-Memory Multiprocessors, IEEE Transactions on Parallel and Distributed Systems (TPDS), Vol. 26, No. 2, February 2015, pp. 444-454

      Lv, H.M. Cui, L. Wang, L. Liu, C.G. Wu, X.B. Feng and P.C. Yew, Dynamic I/O-Aware Scheduling for Batch-Mode Applications on Chip Multiprocessor Systems of Cluster Platform, J. of Computer Science and Technology (JCST), 29(1): 21-37, 2014

      D.Y. Hong, J.J. Wu, P.C. Yew, W.C. Hsu, C.C. Hsu, P. Liu, C.M. Wang and Y.C. Chung, Efficient and Retargetable Dynamic Binary Translation on Multicores, IEEE Transactions on Parallel and Distributed Systems (TPDS), Vol. 25, No. 3, March 2014, pp. 622-632

       S. Mehta, G. Beeraka and P.C. Yew, Tile Size Selection Revisited, ACM Transactions on Architecture and Code Optimization (TACO), Vol. 10, No. 4, December 2013

      L. Gao, L. Li, J.L. Xue and P.C. Yew, SEED: A Statically-Greedy and Dynamically-Adaptive Approach for Speculative Loop Execution, IEEE Transaction on Computers (TC), Vol. 62, No. 5, May 2013, pp. 1004-1016

      S.Y. Wang, P.C. Yew and A. Zhai, Code Transformations for Enhancing the Performance of Speculatively Parallel Threads, J. of Circuits, Systems and Computers (JCSC), No. 2, Vol. 21, 2012

      Z. Wang, C. Wu, P.C. Yew, J.J. Li and X. Di, On-the-Fly Structure Splitting for Heap Objects, ACM Transactions on Architecture and Code Optimization (TACO), January 2012

       H. Chen, R. Chen, F. Zhang, B. Zang and P.C. Yew, Mercury: combining performance with dependability using self-virtualization, Journal of Computer Science and Technology (JCST), 2012.

      H. Chen, J. Wu, C. Huang, P.C. Yew and B. Zang, Dynamic Software Updating Using a Relaxed Consistency Model, IEEE Trans. on Software Engineering (TSE), Vol. 37, No. 5, Sept/Oct 2011, pp. 679-694

      P. Woodward, J. Jayaraj, P.H. Lin, P.C. Yew, Moving Scientific Codes to IBM Cell Processor and Other Multicore Microprocessor CPUs, IEEE Computing in Science and Engineering, Vol.10, No.6, pp.16-25, Nov/Dec. 2008

      S.V. Kodakara, J. Kim, D.J. Lilja, D. Hawkins, W.C. Hsu, and P.C. Yew, CIM: A Reliable Metric for Evaluating Program Phase Classifications, IEEE Computer Architecture Newsletter (CAN), 2007

      J. Lin, W.C. Hsu, P.C. Yew, R.D.C. Ju, and T.F. Ngai, Recovery Code Generation for General Speculative Optimizations, ACM Transactions on Architecture and Code Optimization (TACO), Vol.3, No.1, March 2006, pp. 67-89

      J.S. Kong, P.C. Yew and G.H. Lee, Minimizing the Directory Size for Large-Scale Shared-Memory Multiprocessors, IEICE Trans. on Information and Systems, Vol. E88-D No.11 2533, November 2005, pp. 2533-2543.

      J. Lin, T. Chen, W.C. Hsu, P.C. Yew, R.D.C. Ju, T.F. Ngai and S. Chan, A Compiler Framework for Speculative Optimizations, ACM Transactions on Architecture and Code Optimization (TACO), Vol.1, No.3, September 2004, pp. 247-271

      J. Lu, H. Chen, P.C. Yew, W.C. Hsu, Design and Implementation of a Lightweight Dynamic Optimization System, Journal of Instruction-Level Parallelism, Volume 6, 2004

      P.Y. Tang and P.C. Yew, Interprocedural Induction Variable Analysis, International Journal of Foundation of Computer Science, World Scientific, Vol.14, No.3, June 2003, pp.405-423

      S.J. Lee and P.C. Yew, On Augmenting Trace Cache for High-Bandwidth Value Prediction, IEEE Tran. on Computers (TC), Vol.51, No. 9, September 2002, pp. 1074-1088.

      S.J. Lee and P.C. Yew, On Table Bandwidth and Its Update Delay for Value Prediction on Wide-Issue ILP Processors, IEEE Transactions on Computers (TC), Vol. 50, No.8, August 2001, pp.847-852.

      H.B. Lim and P.C. Yew, Efficient Integration of Compiler-Directed Cache Coherence and Data Prefetching, Journal of Parallel and Distributed Computing (JPDC), Vol. 61, No. 12, Dec 2001, pp. 1775-1802

      S.Y. Cho, P.C. Yew and G. Lee, A High-Bandwidth Memeory Pipeline for Wide-Issue Processors, IEEE Trans. on Computers (TC), Vol. 50, No.7 , July 2001, pp. 709-723.

      L. Choi and P.C. Yew, Compiler Analysis for Cache Coherence: Interprocedural Array Data-Flow Analysis and Its Impact on Cache Performance, IEEE Trans. on Parallel and Distributed Systems (TPDS), Vol. 11, No. 9, Sept 2000, pp. 879-896.

      L. Choi and P.C. Yew, Hardware and Compiler-Directed Cache Coherence in Large-Scale Multiprocessors, the IEEE Trans. on Parallel and Distributed Systems (TPDS), Vol. 11, No. 4, April 2000, pp. 375-394.

      I.H. Kazi, et al., JaViz: A Client/Server Java Profiling Tool, a special issue on Java technology in IBM Systems Journal, Vol. 39, No.1.1, 2000.

      J.Y. Tsai, et al., The Superthreaded Architecture, a special issue on multithreaded architectures in the IEEE Trans. on Computers (TC), Vol 48, No. 9, September 1999, pp. 881-903.

      D.K. Chen and P.C. Yew, Redundant Synchronization Elimination for Doacross Loops, IEEE Trans. on Parallel and Distributed Systems (TPDS), Vol.10, No. 5, May 1999.

      H.B. Lim and P.C. Yew, Maintaining Cache Coherence Through Compiler-Directed Data Prefetching, Journal of Parallel and Distributed Computing (JPDC), Vol 53, No. 2, pp. 144-173, Sep 1998.

      J.Y. Tsai, Z. Jiang, and P.C. Yew, Compiler Techniques for the Superthreaded Architectures, a Special Issue on Languages and Compilers for Parallel Computing, International Journal of Parallel Programming, June 1998.

      J.Y. Tsai, P.C. Yew, et al, Integrating Parallelizing Compilation Technology and Processor Architecture for Cost-Effective Concurrent Multithreading, a special issue in Journal of Information Science and Eng, No. 14, pp.205-222, March 1998

      S. Adve, et al, The Interaction of Architecture and Compilation Technology for High-Performance Processor Design, IEEE Computers, December 1997

      W.T. Hsu and P.C. Yew, Performance Evaluation of Wire-Limited Hierarchical Networks, Journal of Parallel and Distributed Computer (JPDC), Vol. 41, June 1997, pp 156-172.

      J.Y. Tsai and P.C. Yew, Enhancing Multiple-Path Speculative Execution with Predicate Window Shifting, a special issue on Microprocessor Architecture in Journal of System Architecture, June 1997

      L. Choi, H.B. Lim and P.c. Yew, Multiprocessor Cache Coherence: The Compiler-Directed Approach, IEEE Parallel & Distributed Technology, Winter 1996, pp.23-35

      D.K. Poulsen and P.C. Yew, Integrating Fine-Grained Message Passing in Cache Coherent Shared-Memory Multiprocessors, Journal of Parallel and Distributed Computing (JPDC), Vol. 33, No. 2, March 1996, pp. 172-188.

      D.K. Chen and P.C. Yew, On Effective Execution of Non-Uniform Doacross Loops, IEEE Trans. on Parallel and Distributed Systems (TPDS), Vol. 7, No. 5, May 1996, pp. 463-476.

      J.D. Bruner, C.J. Beckmann, P. Konas, D.K. Poulsen and P.C. Yew, Chief: A Simulation Environment for Studying Parallel Systems, International Journal of Computer Simulation, Vol.6, No.1, 1996, pp. 89-100.

      D.J. Lilja and P.C. Yew, Improving Memory Utilization in Cache Coherence Directories, IEEE Trans. on Parallel and Distributed Systems (TPDS), Vol. 4, No.10, Oct. 1993, pp. 1130-1146.

      W.T. Hsu and P.C. Yew, An Effective Synchronization Network for Hot Spot Accesses, ACM Trans. on Computing Systems (TOCS), Vol. 10, No.3, Aug. 1992, pp. 167-189.

      Z. Shen, Z. Li and P.C. Yew, An Empirical Study on Program Characteristics for Parallelizing Compilers, IEEE Trans. on Parallel and Distributed Systems (TPDS), Vol. 1, No. 3, July 1990, pp. 356-364.

      Tim Davis and P.C. Yew, A Stable Non-Deterministic Parallel Algorithm for General Unsymmetric Sparse LU Factorization, SIAM J. on Matrix Analysis and Applications, Vol. 2, No. 3, July 1990, pp. 383-403.

      Z. Li, P.C. Yew and C.Q. Zhu, An Efficient Data Dependence Analysis for Parallelizing Compiler, IEEE Trans. on Parallel and Distributed Systems (TPDS), Vol. 1, No. 1, Jan. 1990, pp. 26-34.

      Z. Fang, P. Tang, P.C. Yew and C.Q. Zhu, Dynamic Processor Self-Scheduling for General Parallel Nested Loops, IEEE Trans. on Computers (TC), Vol. 39, No. 7, July 1990, pp. 919-929.

      P. Tang, P.C. Yew, Software Combining Algorithms for Distributing Hot-Spot Addressing, J. of Parallel and Distributed Computing (JPDC), Vol. 10, No.2, Oct. 1990, pp. 130-139.

      N.F. Tzeng and P.C. Yew and C.Q. Zhu, Realizing Fault-Tolerant Interconnection Networks via Chaining, IEEE Trans. on Computers (TC), Vol. 37, No. 4, pp. 458-462, April 1988.

      Z. Li and P.C. Yew, Program Parallelization with Interprocedural Analysis, J. of Supercomputing, Kluwer Academic Publishers, 1988, pp. 225-244.

      C.Q. Zhu and P.C. Yew, A Scheme to Enforce Data Dependence on Large Multiprocessor Systems, IEEE Trans. on Software Engineering (TSE), Vol. SE-13, No. 6, pp. 726-739, June 1987.

      P.C. Yew, N.F. Tzeng and D.H. Lawrie,, Distributing Hot Spot Addressing in Large Scale Multiprocessors, IEEE Trans. on Computers (TC), Vol. C-36, No. 4, pp. 388-395, April 1987.

      P.C. Yew, D.A. Padua and D.H. Lawrie,, Stochastic Properties of a Multiple-Layer Single-Stage Shuffle-Exchange Network in a Message Switching Environment, J. of Digital Systems, Vol. 6, No. 4, pp. 387-410, 1982.

      P.Y. Chen, D.H. Lawrie, P.C. Yew and D.A. Padua, Interconnection Networks Using Shuffle, IEEE Computer, Vol. 14, No. 12, pp. 55-64, December 1981.

      P.C. Yew and D.H. Lawrie, An Easily Controlled Network for Frequently Used Permutations, IEEE Trans. on Computers (TC), Vol. C-30, No.4, pp. 296-298, April 1981.

 

B. Refereed Conference Papers

 

  W. Wang, P.C. Yew, A. Zhai, S. McCamant, Y. Wu, J. Bobba, Enabling Cross-ISA Offloading for COTS Binaries, The 15th ACM International Conf. on Mobile, Systems, Applications, and Services (MobiSys), June 2017

  W. Wang, A. Zhai and P.C. Yew, A General Persistent Code Caching Framework for Dynamic Binary Translation, Proc. of the 2016 USENIX Annual Technical Conference (ATC), June 2016

  S. Mehta, R. Garg, N. Trivedi and P.C. Yew, TurboTiling: Leveraging Prefetching to Boost Performance of Tiled Codes, Proc. of the 2016 Int't Conf. on Supercomputing (ICS), June 2016.

  C.J. Chang, Y.C. Peng, C.C. Chen, T.F. Chen and P.C. Yew, Adaptive Granularity and Coordinated Management for Timely Prefetching in Multi-core Systems, 2015 International Symposium on VLSI Design, Automation and Test (VLSI-DAT), May 28 2015

  S. Mehta and P.C. Yew, Improving Compiler Scalability: Optimizing Programs at Small Price, Proc. of ACM SIGPLAN Int値 Conf. on Programming Languages Design and Implementation (PLDI), June 2015

  X. Yuan, C. Wu, Z. Wang, J. Li, X. Feng, P.C. Yew, Y. Lan, Y. Chen, J. Huang, Y. Guan, Reproducing Concurrency Bugs using Local Clocks, Proc. of Int'l Conf. on Software Engineering (ICSE), May, 2015

  W. Wang, C. Wu, P.C. Yew, X. Shen, X. Yuan, Z. Wang, J. Li, X. Feng, Localization of Concurrency Bugs Using Shared Memory Access Pairs, 29th IEEE/ACM International Conference on Automated Software Engineering (ASE), September 2014

  S. Mehta, Z. Fang, A. Zhai and P.C. Yew, Multistage Coordinated Prefetching for Present-Day Processors, Proc. of the 2014 Int't Conf. on Supercomputing (ICS), June 2014

  C.F. Chen, C.C. Chen, et al, DAPs: Dynamic Adjustment and Partial Sampling for Multithreaded/Multicore Simulation, Proc. of 51th International Design Automation Conference (DAC), June 2014

  Y.H. Lu, D.Y. Hong, T.Y. Wu, J.J. Wu, P. Liu, W.C. Hsu, and P.C. Yew, DBILL: An Efficient and Retargetable Dynamic Binary Instrumentation Framework using LLVM Backend, Proc. of 10th Int'l Conf. on Virtual Execution Environments (VEE), March 2014

  C.R. Chang, J.J. Wu, P. Liu, W.C. Hsu, and P.C. Yew, Efficient Memory Virtualization for Cross-ISA System Mode Emulation, Proc. of 10th Int'l Conf. on Virtual Execution Environments (VEE), March 2014

  S. Mehta, P.H. Lin, and P.C. Yew, Revisiting Loop Fusion in the Polyhedral Framework, Proc. of ACM SIGPLAN 19th Annual Symp. on Principles and Practice of Parallel Programming (Ppopp), February 2014

  S.H. Chen, S.M. Lin, K.Y. Chen, Y.H. Chang, P.C. Yew, C.C. Ho, A Systematic Methodology for OS Benchmarks Characterization, Proc. of ACM Int値 Conf. on Reliable and Convergent Systems (RACS), October 2013

  V. Mekkat, A. Holey, P.C. Yew and A. Zhai, Managing Last-Level Cache in a Heterogeneous Multicore Processor, Proc. of Int'l Conf. on Parallel Architectures and Compiler Techniques (PACT), September 2013.

  X. Yuan, C. Wu, P.C. Yew, W. Wang, Z. Wang, J. Li and D. Xu, Synchronization Identification through On-the-Fly Test, Proc. of 2013 Euro-Par Conference (Euro-Par), August 2013

  C.C. Hsu, J.J. Wu, P.C. Yew, D.Y. Hong, C.M. Wang, and W.C. Hsu, Improving Dynamic Binary Optimization Through Early-Exit Guided Code Region Formation, 9th Int'l Conf on Virtual Execution Environments (VEE), March 2013

  P.H. Lin, J. Jayaraj, P. Woodward, and P.C. Yew, A Study of Performance Portability Using Piecewise-Parabolic Method (PPM) Gas Dynamics Applications, Proc. of Int値 Conf. on Computational Science (ICCS), May 2012

  D. Xu, C. Wu, P.C. Yew, J. Li, and Z. Wang, Providing Fairness on Shared Memory Multiprocessors via Process Scheduling, ACM SIGMETRICS Performance, June 2012

  D.Y. Hong, C.C. Hsu, P.C. Yew, J.J. Wu, W.C. Hsu, Y.C. Chung, P. Liu and C.M. Wang, HQEMU: A Multi-Threaded and Retargetable Dynamic Binary Translator on Multicores, Proc. of the 10th Annual IEEE/ACM Int'l Symp. on Code Generation and Optimization (CGO), March, 2012

  C.C. Hsu, P. Liu, C.M. Wang, J.J. Wu, D.Y. Hong, P.C. Yew and W.C. Hsu, LnQ: Building High Performance Dynamic Binary Translators with Existing Compiler Backends, Proc. of the 40th International Conference on Parallel Processing (ICPP), Taipei, Taiwan, September 2011

  D. Xu, C. Wu and P.C. Yew, On Mitigating Memory Bandwidth Contention Through Bandwidth-Aware Scheduling, Proc. of Int'l Conf. on Parallel Architectures and Compiler Techniques (PACT), September 2010.

   P. Woodward, et al, Boosting the Performance of Computational Fluid Dynamics Codes for Interactive Supercomputing, Proc. Of Int値 Conf. on Computational Science (ICCS), May 2010

  J. Lin and P.C. Yew, A Compiler Framework for General Memory Layout Optimization Targeting Structures and Arrays, The 12th Annual Workshop on the Interaction between Compilers and Computer Architecture (INTERACT), March 2010.

  Z. Wang, C. Wu and P.C. Yew, On Improving Heap Memory Layout by Dynamic Pool Allocation, Proc. of the 8th Annual IEEE/ACM Int'l Symp. on Code Generation and Optimization (CGO), April, 2010.

  L. Wang, et. al., An Adaptive Task Creation Strategy for Work-Stealing Scheduling, Proc. of the 8th Annual IEEE/ACM Int'l Symp. on Code Generation and Optimization (CGO), April, 2010.

  H. Chen, L. Yuan, X. Wu, B. Zang, B. Huang, P.C. Yew, Control Flow Obfuscation with Information Flow Tracking, Proc. of the 42nd Int'l Symp. on Microarchitecture (MICRO-42), November 2009

   V. Packirisamy, A. Zhai, W.C. Hsu, T.F. Ngai, P.C. Yew, Exploring Speculative Parallelism in SPEC2006, Proc. of IEEE Int値 Symp. On Performance Analysis of Systems and Software (ISPASS), April 2009

  Y. Duan, X. Feng, P.C. Yew, Detecting and Eliminating Violation of Sequential Consistency for Concurrent C/C++ Programs, Proc. of IEEE/ACM Int'l Symp. on Code Generation and Optimization (CGO), March 2009

  (Best Paper Award) V. Packirisamy, Y. Luo, W.L. Hung, A. Zhai and P.C. Yew, Efficiency of Thread-Level Speculation in SMT and CMP Architectures Performance, Power and Thermal Perspective, Proc. Of Int値 Conf. on Computer Design (ICCD), Oct. 2008.

  H. Chen, X. Wu, L. Yuan, B. Zang, P.C. Yew, F.T. Chong, From Speculation Security: Practical and Efficient Information Flow Tracking Using Speculative Hardware, Proc. of 35th Int'l Symp. on Computer Architecture (ISCA-35), June 2008

  G.J. He, A. Zhai and P.C. Yew, Ex-Mon: An Architectural Framework for Dynamic Program Monitoring on Multicore Processors, The 12th Annual Workshop on the Interaction between Compilers and Computer Architecture (INTERACT), Feb 2008.

  (Best Paper Award) H. Chen, R. Chen, F. Zhang, B. Zang and P.C. Yew, Mercury: combining performance with dependability using self-virtualization, Proc. of Int値 Conf. on Parallel Processing (ICPP), Sept. 2007

  S.V. Kodakara, J. Kim, W.C. Hsu, D.J. Lilja, P.C. Yew, Analysis of Statistical Sampling in Microarchitecture Simulation: Metric, Methodology and Program Characterization, Proc. of Int'l Symp. on Workload Characterization (IIWCS), Sept, 2007

  J. Kim, W.C. Hsu and P.C. Yew, COBRA: An Adaptive Runtime Binary Optimization Framework for Multithreaded Applications, Proc. of Int値 Conf. on Parallel Processing (ICPP), Sept. 2007

  S.J. Lee, H.K. Lee, and P.C. Yew, Runtime Performance Projection Model for Dynamic Power Management, Proc. of Asia-Pacific Computer Systems Architecture Conference (ACSAC), Aug. 2007

  J. Kim, S.V. Kodakara, W.C. Hsu, D.J. Lilja, R. Geva, P.C. Yew, Entropy-Based Profile Characterization and Classification for Automatic Profile Management, Proc. of Asia-Pacific Computer Systems Architecture Conference (ACSAC), Aug. 2007

  H. Chen, J. Yu, C. Rong, B.Y. Zang and P.C. Yew, POLUS: A Powerful Live Updating Systems, Proc. of Int'l Conf. on Software Engineering (ICSE), May, 2007

  R. Fu, A. Zhai, P.C. Yew and W.C. Hsu, J. Lu, Reducing Queueing Stalls Caused by Data Prefetching, The 11th Annual Workshop on the Interaction between Compilers and Computer Architecture (INTERACT) May 2007.

  V. Packirisamy, S.Y. Wang, A. Zhai, W.C. Hsu, P.C. Yew, Supporting Speculative Multithreading on Simultaneous Multithreaded Processors, in Proc. of Int'l Conf. on High Performance Computing (HiPC), Bangalore, India, Dec 2006

  S.Y. Wang, A. Zhai, P.C. Yew, Exploiting Speculative Thread-Level Parallelism in Data Compression Applications, in Proc. of 19th Workshop on Languages and Compiler for Parallel Computing (LCPC), New Orlean, LA Nov. 2006

  H.B. Chen, R. Chen, F.Z. Zhang, B.Y. Zang, P.C. Yew, Live Updating Operating Systems Using Virtualization, 2nd Int'l Conf on Virtual Execution Environments (VEE), June 2006

  J. Kim, S.V. Kodakara, W.C. Hsu, D.J. Lilja, P.C Yew, Dynamic Code Region (DCR) Based Program Phase Tracking and Prediction for Dynamic Optimizations, Lecture Notes in Computer Science, Volume 3793 (HiPEAC), Oct 2005, pp. 203 - 217.

  X. Dai, A. Zhai, W.C. Hsu and P.C. Yew, A General Compiler Framework for Speculative Optimizations Using Data Speculative Code Motion, Proc. of the Third Annual IEEE/ACM Int'l Symp. on Code Generation and Optimization (CGO), March 2005, pp. 280-290

  A. Das, J. Lu, H. Chen, J. Kim, P.C. Yew, W.C. Hsu, D.Y. Chen, Performance of Runtime Optimization on BLAST, Proc. of the Third Annual IEEE/ACM Int'l Symp. on Code Generation and Optimization (CGO), March 2005, pp. 86-96

  J. Lin, W.C. Hsu, P.C. Yew, R.D. Ju and T.F. Ngai, A Compiler Framework for Recovery Code Generation in General Speculative Optimizations, Proc. of Int'l Conf. on Parallel Architectures and Compiler Techniques (PACT), September 2004, pp. 17-28

  T. Chen, J. Lin, X. Dai, W.C. Hsu and P.C. Yew, Data Dependence Profiling for Speculative Optimizations, Proc. of 14 Int'l Conf. on Compiler Construction (CC), March 2004, pp. 57-62

  H. Chen, J. Lu, W.C. Hsu, P.C. Yew, Continuous Adaptive Object-Code Re-optimization Framework, Ninth Asia-Pacific Computer Systems Architecture Conference (ACSAC), pp. 241-255, Sept 2004.

  J. Lu, H. Chen, R. Fu, W.C. Hsu, B. Othmer and P.C. Yew, The Performance of Runtime Data Cache Prefetching in a Dynamic Optimization System, Proc. of 36th Annual Int'l Symp. on Microarchitecture (MICRO-36), December 2003

  J. Lin, T. Chen, W.C. Hsu, P.C. Yew, R.D. Ju and T.F. Ngai, A Compiler Framework for Speculative Analysis and Optimizations, Proc. of ACM/SIGPLAN Conf. on Programming Language Design and Implementation (PLDI), June 2003, pp.289-299

  H. Chen, W.C. Hsu, J. Lu, B. Othmer, D.Y. Chen, and P.C. Yew, Dynamic Trace Selection Using Performance Monitoring Hardware Sampling, Proc. of the 1st IEEE/ACM Int'l Symp. on Code Generation and Optimization (CGO), March 2003, pp. 79-90

  J. Lin, T. Chen, W.C. Hsu and P.C. Yew, Speculative Register Promotion Using Advanced Load Address Table (ALAT), Proc. of the 1st IEEE/ACM Int'l Symp. on Code Generation and Optimization (CGO), March 2003, pp. 125-134

  T. Chen, J. Lin, W.C. Hsu and P.C. Yew, An Empirical Study on the Granularity of Pointer Analysis in C Programs, Proc of the 15th Workshop on Languages and Compilers for Parallel Computing (LCPC), Aug. 2002

  W.C. Hsu, H. Chen, P.C. Yew and D.Y. Chen, On the Predictability of Program Behavior Using Different Input Data Sets, Proc. of the 6th Workshop on Interaction Between Compilers and Computer Architectures (INTERACT-6), Feb 2002.

  P.Y. Tang and P.C. Yew, Interprocedural Induction Variable Analysis, Proc. of 6th Int'l Symp. on Parallel Architectures, Algorithms and Networks (I-SPAN), pp. 245-250, May 2002.

  T. Chen, J. Lin, W.C. Hsu and P.C. Yew, An Empirical Study on the Characteristics of Heap-Oriented Pointers in C Programs, Proc. of 6th Int'l Symp. on Parallel Architectures, Algorithms and Networks (I-SPAN), pp.251-256, May 2002

  S.J. Lee and P.C. Yew, On Some Implementation Issues for Value Prediction on Wide-Issue ILP Processors, Proc. of Int'l Conf. on Parallel architectures and Compiler Techniques (PACT), Oct. 2000, pp.145-156

  S.J. Lee, Y. Wang and P.C. Yew, Decoupled Value Prediction on Trace Processors, Proc. of Int'l Conf on High-Performance Computer Architecture (HPCA-6), Jan 2000, pp.231-240

  (Best Paper Award) H.B. Lim and P.C. Yew, Efficient Integration of Compiler-Directed Cache Coherence and Data Prefetching, Proc. of the 2000 Int'l Parallel and Distributed Processing Symposium (IPDPS) (Best Paper Award), May 2000, pp. 331-342

  S.Y. Cho, P.C. Yew and G.H. Lee, Access Region Locality for High-Bandwidth Processor Memory System Design, Proc. of the 32nd Int'l Symp. on Microarchitecture (MICRO-32), Nov. 1999, pp.136-146

  S.Y. Cho and P.C. Yew, Decoupling Local Variable Accesses in a Wide-Issue Superscalar Processor, Proc. of the 26th Intn'l Symp. on Computer Architecture (ISCA-26), May 1999, pp.100-110

  B. Zheng, et. al., Designing the Agassiz Compiler for Concurrent Multithreaded Architectures, Proc. of the 12th Workshop on Languages and Compilers for Parallel Computing (LCPC-12), Aug. 1999

  J.Y. Tsai, Z. Jiang, E. Ness, and P.C. Yew, Performance of a Concurrent Multithreaded Processors, Proc. of the 4th International Symposium of High Performance Computer Architectures (HPCA-4), Feb. 1998, pp. 24-34

  S. Cho, J.Y. Tsai, et.al, High-Level Information - An Approach for Integrating Front-end and Back-end Compilers, Proc of the 1998 Int'l Conf on Parallel Processing (ICPP), Aug. 1998, pp. 346-355

  H.B. Lim and P.C. Yew, An Integrated Framework for Compiler-Directed Cache Coherence and Data Prefetching, Proc. of the 11th Workshop on Languages and Compilers for Parallel Computing (LCPC-11), Aug. 1998

  H.B. Lim, and P.C. Yew, A Compiler-Directed Cache Coherence Scheme Using Data Prefetching, Proc. of the Int'l Symp. on Parallel Processing (IPPS), Apirl 1997, pp. 643-649

  J.Y. Tsai, B. Zheng, and P.C. Yew, Program Optimization for Concurrent Multithreaded Architectures, Proc. of the 10th Workshop on Languages and Compilers for Parallel Computing (LCPC-10), Aug. 1997

  L. Choi and P.C. Yew, Compiler and Hardware Support for Cache Coherence in Large-Scale Multiprocessors: Design Considerations and Performance Evaluation, Proc. of the 23rd Int'l Symp on Computer Architecture (ISCA-23), May 1996, pp. 283-294

  L. Choi and P.C. Yew, Program Analysis for Cache Coherence: Beyond Procedural Boundaries, Proc. of the 1996 Int'l Conf. on Parallel Processing (ICPP), Aug. 1996, Vol. 3, pp. 103-114,

  L. Choi and P.C. Yew, Eliminating Stale Data References through Array Data-Flow Analysis, Proc. of the 1996 Int'l Symp. on Parallel Processing (IPPS), April, 1996, pp. 4-13

  J.Y. Tsai and P.C. Yew, The Superthreaded Architecture: Thread Pipelining for Run-Time Data Dependence Checking and Control Speculation, Proc. of the 1996 Int'l Conf. on Parallel architectures and Compiler Techniques (PACT), Oct. 1996, pp.35-46

  W.T. Hsu and P.C. Yew, Let Us Build System-Friendly Networks Build Them Hierarchically, invited paper for 1996 ICPP Workshop on Challenges for Parallel Processing, Aug 1996

  Z. Li, J.Y. Tsai, X. Wang, P.C. Yew and B. Zheng, Compiler Techniques for Concurrent Multithreading with Hardware Speculation Support, Proc. of the 9th Workshop on Languages and Compilers for Parallel Computing (LCPC-9), Aug. 1996

  H.B. Lim, L. Choi and P.C. Yew, On Using Data Prefetching for Cache Coherence in Multiprocessors, Proc. of the 9th Workshop on Languages and Compilers for Parallel Computing (LCPC-9), Aug. 1996

  P. Konas and P.C. Yew, Processor Self-Scheduling in Parallel Discrete Event Simulation, Proc. of the 1995 Winter Simulation Conference, December 1995.

  L. Choi and P.C. Yew, Interprocedural Array Data-Flow Analysis for Cache Coherence, Eighth Workshop on Languages and Compilers for Parallel Computing (LCPC-8), August 1995.

  P. Konas and P.C. Yew, Partitioning for Synchronous Parallel Simulation, Proc. of the ACM/IEEE/SCS 9th Workshop on Parallel and Distributed Simulation, 1995

  D.K. Chen, J. Torrellas and P.C. Yew, An Efficient Algorithm for the Run-Time Parallelization of Doacross Loops, Proc. of Supercomputing '94, pp. 518-527

  L. Choi and P.C. Yew, A Compiler-Directed Cache Coherence Scheme with Improved Intertask Locality, Proc. of Supercomputing '94, pp. 773-782

  D. Poulsen and P.C. Yew, Data Prefetching and Data Forwarding in Shared-Memory Multiprocessors, Proc. of the Int'l Conf. on Parallel Processing (ICPP), Vol. II, Aug. 1994, pp. 276-280

  D.K. Chen and P.C. Yew, Statement Reordering for Doacross Loops, Proc. of the Int'l Conf. on Parallel Processing (ICPP), Vol.II, Aug. 1994, pp. 24-28

  D.K. Chen and P.C. Yew, Redundant Synchronization Elimination for Doacross Loops, Prof. of 1994 Int'l Parallel Processing Symp. (IPPS), April 1994, pp. 477-481

  P. Konas and P.C. Yew, Improved Parallel Architectural Simulations on Shared-Memory Multiprocessors, Proc. of the ACM/IEEE/SCS 8th Workshop on Parallel and Distributed Simulation, July 1994.

  D.J. Kuck, et. al, The Cedar System and an Initial Performance Study, Proc. of the 20th Symp. on Computer Architecture (ISCA-20), May 1993, pp.213-223

  D.K. Poulsen and P.C. Yew, Execution-Driven Tools for Parallel Simulation of Parallel Architecture and Applications, Proc. of Supercomputing '93, Nov. 1993, pp. 860-869

  D.K. Chen and P.C. Yew, Efficient Synchronization for Doacross Loops Execution, Proc. of 1992 Int'l Conf. on Parallel Processing (ICPP), Aug. 1992

  W.T. Hsu and P.C. Yew, The Impact of Wiring Constraints on Hierarchical Network Performance, Proc. of the 1992 Int'l Parallel Processing Symp. (IPPS), March, 1992, pp.580-588

  P. Konas and P.C. Yew, Synchronous Parallel Discrete Event Simulation on Shared-Memory Multiprocessors, Proc. of the 6th Workshop on Parallel and Distributed Simulation, Jan. 1992, pp.12-21.

  H.M. Su and P.C. Yew, Efficient Doacross Execution for Distributed Shared-Memory Systems, Proc. of Supercomputing '91, Nov. 1991, pp.842-853

  D.K. Chen and P.C. Yew, An Empirical Study on Doacross Loops, Proc. of Supercomputing '91, Nov. 1991, pp. 620-632

  J. Konicek, et al, The Organization of the Cedar System, Proc. of 1991 Int'l Conf. on Parallel Processing (ICPP), Aug. 1991, pp.49-56

  D.J. Lilja and P.C. Yew, Combining Hardware and Software Cache Coherence Strategies, Proc. of the 1991 Int't Conf. on Supercomputing (ICS), June 1991, pp. 274-283

  H.B. Lim and P.C. Yew, Parallel Program Behavioral Study on a Shared-Memory Multiprocessor, Proc. of the 1991 Int'l Conf. on Supercomputing (ICS), June, 1991, pp. 386-395

  H.M. Su and P.C. Yew, Efficient Interprocessor Communication on Distributed Shared-Memory Multiprocessors, Proc. of the 1991 Int'l Conf. on Parallel Processing (ICPP), Vol.1, Aug. 1991, pp. 45-48

  W.T. Hsu and P.C. Yew, The Performance of Hierarchical Systems with Wiring Constraints, Proc. of the 1991 Int'l Conf. on Parallel Processing (ICPP), Vol. 1, Aug. 1991, pp. 9-16

  P. Konas, P.C. Yew, Parallel Event Discrete Event Driven Simulation on Shared-Memory Multiprocessors, Proc. of the 24th Annual Simulation Symp., April, 1991, pp. 134-148.

  J. Bruner, H. Cheong, A. Veidenbaum and P. C. Yew, Chief: A Parallel Simulation Environment for Parallel Systems, Proc. of the 5th Int'l Parallel Processing Symp (IPPS). April, 1991, pp. 568-575

  W. T. Hsu and P.C. Yew, An Effective Synchronization Network for Large Multiprocessor Systems, Proc. of the 5th Int'l Parallel Processing Symp (IPPS). May, 1991, pp. 309-317

  D. Lilja and P.C. Yew, Comparing Parallelism Extraction Techniques: Superscalar Processors, Pipelined Processors and Multiprocessors, Proc. of 1990 Int'l Conf. on Parallel Processing (ICPP), Aug. 1990, pp. 563-564

  P. Tang, P.C. Yew and C.Q. Zhu, Compiler Algorithms for Data Synchronization in Nested Parallel Loops, Proc. of 1990 Int'l Conf. on Supercomputing (ICS), June 1990, pp. 177-186

  D.K. Chen, H.M. Su and P.C. Yew, The Impact of Synchronization and Granularity on Parallel Systems, Proc. of 17th Int'l Symp. on Computer Architecture (ISCA-17), June 1990, pp. 239-249

  P.C. Yew and J. Bruner, SEE: A System Evaluation Environment for Studying Parallel Systems, Proc. of the First Workshop on Parallel Processing, Dec. 1990.

  H.M. Su and P.C. Yew, On Data Synchronization for Multiprocessors, Proc. of the 16th Int'l Symp. on Computer Architecture (ISCA-16), 1989, pp 416-423

  Z. Shen, Z. Li and P.C. Yew, An Empirical Study on Array Subscripts and Data Dependences, Proc. of the 1989 Int'l Conf. on Parallel Processing (ICPP), Aug. 1989, pp 145-152

  P.Y. Tang and P.C. Yew, A Parallel Linked List for Shared-Memory Multiprocessors, Proc. of the 1989 Computer Software and Application Conf, Oct. 1989, pp.130-135.

  Z. Li, P.C. Yew and C.Q. Zhu, Data Dependence Analysis on Multi-Dimensional Array References, Proc. of the 1989 Int'l Conf. on Supercomputing, June 1989, pp 215-224

  P.A. Emrath, D.A. Padua and P.C. Yew, Cedar Architecture and Its Software, 22nd Hawaii Intn'l Conf. on System Sciences, Jan. 1989, pp 306-315.

  Z. Li and P.C. Yew, Efficient Interprocedural Analysis for Parallel Programs, ACM SIGPLAN Symp. on Parallel Programming: Experience with Applications, Languages and Systems, July 1988, pp. 85-99

  P. Tang, P.C. Yew and C.Q. Zhu, Impact of Self-Scheduling Order on Performance of Multiprocessor Systems, Proc. of the 1988 Int'l Conf. on Supercomputing (ICS), pp. 593-603

  Z. Li and P.C. Yew, Interprocedural Analysis for Parallel Computing, Proc. of the 1988 Int'l Conf. on Parallel Processing (ICPP), pp. 221-228

  W.T. Hsu and P.C. Yew, A Scheme to Enhance Binary N-Cube Networks, Proc. of the 1987 Int'l Conf. on Parallel Processing (ICPP), pp. 820-823

  R.L. Lee, P.C. Yew and D.H. Lawrie, Data Prefetching in Shared Memory Multiprocessors, Proc. of the 1987 Int'l Conf. on Parallel Processing (ICPP), pp. 28-31

  Z. Fang, P.Y. Tang and P.C. Yew, C.Q. Zhu, Dynamic Processor Self-Scheduling for General Parallel Nested Loops, Proc. of the 1987 Int'l Conf. on Parallel Processing (ICPP), pp. 1-10

  R.L. Lee, P.C. Yew and D.H. Lawrie, Multiprocessor Cache Design Considerations, Proc. of the 14th Int'l Symp. on Computer Architecture (ISCA-14), pp. 253-262, 1987

  P.Y. Tang and P.C. Yew, Deadlock Prevention in Processor Self-Scheduling for Nested Parallel Loops, Proc. of the 1987 Int'l Conf. on Parallel Processing (ICPP), pp. 11-18, 1987

  N.F. Tzeng, P.C. Yew and C.Q. Zhu, Fault-Diagnosis in a Multiple-Path Interconnection Networks, Proc. of the 16th Int'l Symp.on Fault-Tolerance Computing, pp.98-103, July 1986

  P.Y. Tang and P.C. Yew, Processor Self-Scheduling for Multiple-Nested Parallel Loops, Proc. of the 1986 Int'l Conf. on Parallel Processing (ICPP), St. Charles, IL., pp. 528-535, Aug. 1986

  N.F. Tzeng, P.C. Yew and C.Q. Zhu, A Fault-Tolerant Scheme for Multistage Interconnection Networks, Proc. of the 12th Int'l Symp. on Computer Architecture (ISCA-12), pp. 368-375, June 1985

  N.F. Tzeng, P.C. Yew and C.Q. Zhu, The Performance of A Fault-Tolerant Multistage Interconnection Network, Proc. of the 1985 Int'l Conf. on Parallel Processing (ICPP), pp. 458-465, Aug. 1985

  C.Q. Zhu and P.C. Yew, A Synchronization Scheme and Its Applications for Large Multiprocessor Systems, Proc. of the 4th Int'l Conf. on Distributed Computing Systems, pp. 486-493, May 1984.

  Q.X. Xu and P.C. Yew, Simulations and Analysis for a Multiprocessor System with Multiprogramming, Proc. of the First Int'l Conf. on Computers and Applications, June 1984.

  P.Y. Chen, P.C. Yew and D.H. Lawrie, Performance of Packet Switching in a Buffered Single-Stage Shuffle-Exchange Network, Proc. of the 3rd Int'l Conf. on Distributed Computing Systems, pp. 622-629, Oct., 1982.

  W. Abu-Sufah, R. Lee, M. Malkawi and P.C. Yew, Experimental Results on the Paging Behavior of Numerical Programs, Proc. of the 6th Int'l Conf. on Software Engineering (ICSE), pp. 110-117, Sept., 1982.

  J.E. Lilienkamp, D.H. Lawrie and P.C. Yew, A Fault Tolerant Interconnection Networks Using Error Correcting Codes, Proc. of the 1982 Int'l Conf. on Parallel Processing (ICPP), pp.123-125, Aug. 1982