Selected Publications
Zhiyuan Li
Copyright and all rights therein
are retained by authors or by other copyright holders.
All persons copying this information are expected to adhere to the terms
and constraints invoked by each author's copyright. In most cases,
these works may not be reposted without the explicit permission
of the copyright holder.
2016
-
Feng Li, Zhiyuan Li, Wei Huo and Xiaobing Feng,
"Locating Software Faults Based on Minimum Debugging Frontier Set",
IEEE Trans. on Software Engineering (to appear)
( Abstract )
This preprint contains typesetting errors. For the final version check on IEEE Transactions on Software Engineering's published volumes.
-
Ye Wang, Zhiyuan Li,
"GridFOR: A Domain Specific Language for Parallel Grid-based Applications",
International Journal of Parallel Programming 44(3): 427-448 (2016).
DOI: 10.1007/s10766-014-0348-z. ( Abstract )
2015
-
Zhiyuan Li, "Author's retrospective for array privatization for parallel execution of loops", ACM International Conference on Supercomputing (ICS) 25th Anniversary, 21-23.
( Abstract )
-
Xingjing Lu, Long Chen, Zhiyuan Li, "Performance Evaluation and Enhancement of Process-Based Parallel Loop Execution",
International Journal of Parallel Programming, DOI: 10.1007/s10766-015-0394-1.
( Abstract )
2014
-
Yingchong Situ, Chandra S. Martha, Matthew E. Louis, Zhiyuan Li, Ahmed H. Sameh, Gregory A. Blaisdell, Anastasios S. Lyrintzis
"Petascale large eddy simulation of jet engine noise based
on the truncated SPIKE algorithm",
Parallel Computing, Volume 40, Issue 9, October, 2014.
2013
-
Man Wang and Zhiyuan Li.
"Global Property Violation Detection and Diagnosis for
Wireless Sensor Networks",
Proceedings of International Conference on Compilers, Architecture, and Synthesis for Embedded Systems (CASES), Sept. 29 -- Oct. 4, 2013, Montreal, Canada (ESWeek). ACM Press.
-
Yingchong Situ, Lixia Liu, Chandra Martha, Matthew Louis, Zhiyuan Li, Ahmed Sameh, Gregory Blasidell, Anatasios Lyrintzis.
"A communication-efficient linear system solver for large eddy simulation of jet engine
noise", Cluster Computing,
Volume 16, Issue 1 (2013), pp 157--170.
-
Hongtao Yu, Hou-Jen Ko, Zhiyuan Li,
"General Data Structure Expansion for Multi-threading" ACM
SIGPLAN 2013 Conference on Programming Language Design and Implementation (PLDI),
Seattle, Washington, USA
16 June 2013 -- 22 June 2013.
-
Yingchong Situ, Ye Wang, Zhiyuan Li,
"Automated rapid prototyping of regular grid-based numerical applications using generalized elemental subroutines",
27th IEEE International Parallel & Distributed Processing Symposium (IPDPS),
May 20-24, 2013,
Hyatt Regency Cambridge,
Boston, Massachusetts USA.
-
F. Li, W. Huo, C. Chen, L. Zhong, X. Feng, Z. Li,
"
Effective Fault Localization Based on Minimum Debugging Frontier Set
", IEEE/ACM International Symposium on Code Generation and Optimization (CGO)
February 23 - 27, 2013,
Shenzhen, China
2012
-
Hongtao Yu, Zhiyuan Li,
"Multi-slicing: A Compiler-Supported Parallel
Approach to Data Dependence Profiling", ACM International Symposium on Software Testing and Analysis (ISSTA 2012)
July 15th-20th, 2012, Minneapolis, Minnesota.
-
Hongtao Yu, Zhiyuan Li
"Fast Loop-level Data Dependence Profiling", 26th ACM International Conference on Supercomputing, 25-29 June 2012, San Servolo Conference Center,
San Servolo Island, Venice, Italy
2011
-
Gregory Blaisdell, Anastasios Lyrintzis, Yingchong Situ, Chandra Sekhar Martha, Matthew Louis, and Zhiyuan Li,
"Recent Advances in Large Eddy Simulations for Jet Noise Predictions", Inter-Noise 2011, Osaka, Japan, September, 2011.
-
Malcolm Owen Ng, Meng Qu, Pengxuan Zheng, Zhiyuan Li, Yin Hang,
"CO2-based Demand Controlled Ventilation under New ASHRAE Standard 62.1-2010: a case study for a Gymnasium of an Elementary School at West Lafayette, Indiana",
Energy & Buildings, 43(11), Elsevier.
-
Man Wang, Zhiyuan Li, Feng Li, Xiaobing Feng, Saurabh Bagchi and Yung-Hsiang Lu
"Dependence-based Multilevel Tracing and Replay for Wireless Sensor Networks Debugging",
(Abstract and introduction)
ACM SIGPLAN/SIGBED Conference on Languages, Compilers, Tools and Theory for Embedded Systems
(LCTES),
April 12-14, 2011, Chicago, Illinois
2010
-
Yingchong Situ, Lixia Liu, Chandra S. Martha, Matthew E. Louis,
Zhiyuan Li, Ahmed Sameh, Gregory A. Blaisdell, Anastasios S. Lyrintzis
"Reducing Communication Overhead in Large Eddy
Simulation of Jet Engine Noise",
IEEE International Conference on Cluster Computing (IEEE Cluster 2010)
September 20-24, 2010, Heraklion, Crete, Greece.
(Abstract and introduction)
-
Lixia Liu and Zhiyuan Li
"A Compiler-automated Array Compression Scheme for Optimizing
Memory Intensive Programs",
24th ACM International Conference on Supercomputing (ICS),
June 1-4, 2010, Epochal Tsukuba, Tsukuba, Japan. (Abstract and Introduction) (.pdf)
-
Lixia Liu and Zhiyuan Li
"Improving Parallelism and Locality with Asynchronous Algorithms",
15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP). January 9-14, 2010, Bangalore, India. Presentation Slides (.pdf)
Test programs used in the experiments (.tgz)
2009
2008
- Russell Meyers and Zhiyuan Li,
"ASYNC Loop Constructs for Relaxed Synchronization"
(A preprint) , 21st
Annual International Workshop on Languages and Compilers for Parallel Computing
(LCPC), Edmonton, Alberta, Canada, July 31 - August 2, 2008 (Final paper to appear in proceedings published by Springer).
- Vinaitheerthan Sundaram, Saurabh Bagchi, Yung-Hsiang Lu, Zhiyuan Li,
"SeNDORComm: An Energy-Efficient Priority-Driven Communication Layer
for Reliable Wireless Sensor Networks"
,
27th IEEE International Symposium on Reliable Distributed Systems
(SRDS), pp. 23 --32, Napoli, Italy, October 6-8, 2008
- Lixia Liu, Zhiyuan Li, Ahmed H. Sameh, "Analyzing Memory Access Intensity in
Parallel Programs on Multicore", (Abstract in PDF) , 22nd ACM International Conference on Supercomputing (ICS),
Island of Kos, Aegean Sea, Greece, June 7-12, 2008.
(Corrigendum)
- Fang Lu, Lei Wang, Xiaobing Feng, Zhiyuan Li,Zhaoqing Zhang, "Exploiting Idle Register Classes for Fast Spill Destination", (Abstract in PDF) , 22nd ACM International Conference on Supercomputing (ICS),
Island of Kos, Aegean Sea, Greece, June 7-12, 2008.
- Changjiu Xian, Yung-Hsiang Lu, Zhiyuan Li,
"Dynamic Voltage Scaling for Multitasking Real-Time
Systems with Uncertain Execution Time",
(See Abstract)
IEEE Transactions on
COMPUTER-AIDED DESIGN of Integrated Circuits and Systems
(TCAD) , 27(8), pp. 1467 --1478, August 2008 .
2007
- Douglas Herbert,
Vinaitheerthan Sundaram,
Yung-Hsiang Lu, Saurabh Bagchi, Zhiyuan Li, Adaptive Correctness Monitoring for Wireless
Sensor Networks Using Hierarchical
Distributed Run-Time Invariant Checking ACM Transactions
on Autonomous and Adaptive Systems (TAAS) , 2(2), pp. 8:1--8:23, 2007.
- Zhiyuan Li, "Simultaneous Minimization of Capacity and Conflict Misses",
Journal of Computer Science and Technology, 22(4), pp. 497--504
- Changjiu Xian, Yung-Hsiang Lu, and Zhiyuan Li,
"A Programming Environment with Runtime Energy Characterization
for Energy-Aware Applications", International Symposium on
Low Power Electronics and Design 2007 (ISLPED) (Abstract in PDF)
- Changjiu Xian, Yung-Hsiang Lu, and Zhiyuan Li,
"Energy-Aware Scheduling for Real-Time Multiprocessor Systems with Uncertain Task Execution Time" Design Automation Conference 2007. (Abstract in PDF)
- Douglas Herbert, Vinaitheerthan Sundaram, Lila Albin, Yung-Hsiang Lu, Saurabh Bagchi, and Zhiyuan Li,
"Pervasive Carbon Dioxide and Temperature Monitoring Utilizing Large Numbers of Low-Cost Wireless Sensors", American Industrial Hygiene Conference and Exposition 2007 .
- Changjiu Xian, Yung-Hsiang Lu, and Zhiyuan Li, ``Adaptive Computation
Offloading for Energy Conservation on Battery-Powered Systems'',
International Conference on Parallel and Distributed Systems (ICPADS) 2007.
2006
- Douglas Herbert, Yung-Hsiang Lu, Saurabh Bagchi, Zhiyuan Li, Detection and Repair of Software Errors in Hierarchical Sensor Networks in Proceedings of The IEEE International Conference on Sensor Networks, Ubiquitous, and Trustworthy Computing (SUTC2006)
- Yuldi Tirta, Bennett Lau, Nipoon Malhotra, Saurabh Bagchi, Zhiyuan Li, and Yung-Hsiang Lu,
"Controlled Mobility for Efficient Data Gathering in Sensor Networks with Passively Mobile Nodes", Section 3.2, pages 92-113,
in Sensor Networks Operations, Editors: Shashi Phoha, Thomas La Porta, and Christopher Griffin.
Wiley-IEEE Press, 2006, ISBN 0-471-71976-5.
2005
- Rong Xu and Zhiyuan Li, ``A Sample-Based Cache Mapping Scheme''
(preprint in PDF) ,
in Proceedings of the ACM SIGPLAN/SIGBED 2005 Conference
on Languages, Compilers, and Tools for Embedded Systems (LCTES'05) Chicago, Illinois, June 15-17, 2005.
- Zhiyuan Li and Rong Xu.
``Impact of Cache Mapping on Memory Performance of a StrongArm Processor'',
Journal of Embedded Computing, special issue on
Cache Analysis and Optimization for Embedded Systems,
in July, 2005. (abstract in PDF)
- Yonghong Song, Cheng Wang, and Zhiyuan Li,
``A Polynomial-Time Algorithm for Memory Space Reduction'', International Journal on Parallel Programming , 33(1), Feb. 2005. (preprint in PDF)
2004
- Yuldi Tirta, Zhiyuan Li, Yung-Hsiang Lu, and Saurabh Bagchi,
``Efficient Collection of Sensor Data in Remote Fields Using
Mobile Collectors'',
13th IEEE International
Conference on Computer Communications and Networks (ICCCN'04)
October 11-13, 2004 Chicago, IL.
- Yonghong Song and Zhiyuan Li.
``Applying Array Contraction to A Sequence of DOALL Loops'',
Proceedings of International Conference on Parallel Processing,
pp. 46-53. August, 2004,
Montreal, Canada. IEEE Computer Society Press. (Preprint)
- Cheng Wang and Zhiyuan Li,
``Parametric Analysis For Adaptive Computation Offloading'' (abstract in PDF), in Proceedings of ACM SIGPLAN 2004 Conference on
Programming Language Design and Implementation (PLDI) Washington, DC, June 9-11, 2004. Copyright ACM, Inc.
- Zhiyuan Li and Yonghong Song ``Automatic Tiling of Iterative Stencil Loops'' (.pdf) (Preprint), ACM Trans. on Programming Languages and Systems 26(6), pp. 975--1028, November, 2004. Copyright ACM, Inc.
- Yonghong Song, Rong Xu, Cheng Wang and Zhiyuan Li ``Improving Data Locality by Array Contraction'' (.pdf) (Preprint), IEEE Trans. on Computers ,
53(9), pp. 1073--1084, September, 2004.
- Yonghua Ding and Zhiyuan Li, ``A Compiler Scheme for Reusing Intermediate Computation Results,'' in Proceedings of IEEE/ACM 2004 International Symposium
on Code Generation and Optimization (CGO 2004)} , March 20-24 2004, Palo Alto, CA.
- Cheng Wang and Zhiyuan Li, ``A Compiler Scheme For Computation Offloading on Wireless-Networked
Handheld Devices'' . Journal of Parallel and Distributed Computing,
64(6), pp. 740--746, June, 2004.
- Rong Xu and Zhiyuan Li, ``Using Cache Mapping to Improve Memory Performance of Handheld Devices,'', in Proceedings of The 4th IEEE International Symposium on
Performance Analysis of Systems and Software
(ISPASS-2004) , March 10 -12, 2004, Austin, Texas, USA
- Peifeng Ni and Zhiyuan Li, ``Energy Cost Analysis of IPSEC on Handheld Dev
ices'', Microprocessors and Microsystems (special issue on Secure Computing Platform)
28 (10), pp. 585--594, November, 2004.
2003
- Yonghua Ding and Zhiyuan Li, ``Compiler Analysis of Interprocedural Data Communication,'' Proceedings of IEEE/ACM Supercomputing Conference (SC2003)} ,
Nov. 2003, Phoenix, Arizona.
- R. Xu, Z. Li, C. Wang and P. Ni, ``Impact of Data Compression on Energy Consumption of
Wireless-Networked Handheld Devices'', (.ps> Proc. the 23rd IEEE International Conference on
Distributed Computing Systems (ICDCS'03),
May, 2003, Providence,
Rhode Island. IEEE Computer Society Press.
(An extended technical report: ``A Report on Impact of Data Compression on
Energy Consumption of Wireless-Networked
Handheld Devices'' (.ps/1.3Mb)
Technique Report CSD-TR-03-003,
Department of Computer Sciences,
Purdue University, West Lafayette, IN 47907,
February, 2003.)
- Z. Li,
``
Optimal Skewed Tiling for Cache Locality Enhancement
'' Abstract
(.ps) A revision of the full paper is to appear in Proc. of 17th International Parallel and Distributed Processing Symposium (Sponsored by IEEE Computer Society and ACM SIGARCH)
April, 2003, Nice, France.
IEEE Computer Society Press.
2002
- Z. Li and R. Xu,
``Energy Impact of Secure Computation on
a Handheld Device'' (.pdf/899KB) (.ps/928KB) Proc. IEEE 5th Annual Workshop on Workload Characterization
(WWC-5)}, November, 2002,
Austin, Texas.
IEEE Computer Society Press.
- Z. Li, C. Wang and R. Xu, ``Task allocation for distributed multimedia processing on
wirelessly networked handheld devices'' (preliminary version) (.ps)
A revision is in Proc. of 16th International Parallel and Distributed Processing Symposium (Sponsored by IEEE Computer Society, etc.)
April, 2002, Fort Lauderdale, Florida.
IEEE Computer Society Press.
2001
- Z. Li, C. Wang and R. Xu, ``Computation offloading
to save energy on handheld devices: A partition scheme.'' (.ps) Proc. of International Conference on Compilers,
Architectures and Synthesis for Embedded Systems,
Nov. 2001, Atlanta, Georgia, pp. 238--246, ACM Press.
- Y. Song, R. Xu, C. Wang and Z. Li, ``Data Locality Enhancement by Memory Reduction'', in Proc. of ACM 15th International Conference on Supercomputing , June, 2001, pp. 50 -- 64. Copyright 2001 by ACM, Inc. ICS01 presentation slides (.ps)
2000
1999
- Yonghong Song and Zhiyuan Li,
New Tiling Techniques to Improve Cache Temporal Locality
(.ps), in Proc. of ACM SIGPLAN Conference on Programming
Language Design and Implementation , May, 1999, pp. 215 -- 228. Copyright 1999 by ACM, Inc. PLDI99 presentation slides (.ps)
- Z. Li, Reducing Cache Conflicts by Partitioning and Privatizing Shared
Arrays (.ps) (This version corrects Table 1 in the conference
Proceedings.) In Proc. of the 1999 International Conference on Parallel
Architectures and Compilation Techniques (PACT99), IEEE Computer Society and IFIP Working Group 10.3,
October, 1999, pp. 183 -- 190.
[PACT99 presentation slides (.ps) ]
1998
- G. Jin, Z. Li and F. Chen, An Efficient Solution to the Cache
Thrashing Problem, IEEE Trans. on Computers, 47(5), May 1998, pp. 527--543.
- G. Jin, Z. Li and F. Chen.
``A theoretical foundation for program transformations to
reduce cache thrashing due to true data-sharing'' Theoretical Computer Science , 255(2), pp. 449 -- 481,
2001. Abstract and paper at the publisher's web site
- T. N. Nguyen and Z. Li, Interprocedural analysis for loop scheduling and
data allocation, , Parallel Computing, Special Issue on Languages and Compilers for Parallel Computers,
24(3), pp. 477--504, 1998.
- Z. Li, J. Huang and G. Jin, Page Mapping Techniques to Reduce Cache Conflicts
on CC-NUMA Multiprocessors, Microprocessors and Microsystems, special issue on Parallel Algorithms and
Architectures, Vol. 22, Nos. 3-4, 28 August 1998.
- S. Cho, J.-Y. Tsai, Y. Song, B. Zheng,
S. Schwinn, X. Wang, Q. Zhao, Z. Li, D. J. Lilja,
and P.-C. Yew. High-Level Information: An Approach for Integrating Front-End
and Back-End Compilers, Proceedings of International Conference on Parallel Processing, August, 1998.
- J.-Y. Tsai, Z. Jiang, Z. Li, D. J. Lilja, X. Wang,
P.-C. Yew, B. Zheng, S. J. Schwinn, and R. Glamm, Integrating Parallelizing
Compilation Technology and Processor Architecture for
Cost-Effective Concurrent Multithreading, Journal of Information Science and Engineering,
Special Issue on Compiler Techniques for High-Performance
Computing, 14(1), March 1998.
- Z. Li, J. Gu and G. Lee, Interprocedural analysis based on guarded array regions,
Chapter 5
in Languages, Compilation Techniques and
Run Time Systems for Scalable Parallel Machines, Eds. Agrawal and Pande, Springer-Verlag, 1998.
1997
- J. Gu, Z. Li and G. Lee, Experience with efficient array data flow
analysis for array privatization, Proc. Sixth ACM SIGPLAN Symposium on Principles
and Practice of Parallel Programming, Las Vegas, Nevada, June 18-21, 1997,
pp. 157 -- 167.
- J. Huang, G. Jin and Z. Li, ``Page-mapping techniques for CC-NUMA
multiprocessors'',
Proc. Third IEEE International Conference on Algorithms and
Architectures for Parallel Processing , Melbourne, Australia.
Dec. 10-12, 1997, pp. 91 -- 104.
- J. Huang and Z. Li, ``Reducing cache misses for CC-NUMA by careful
page-mapping'',
Proc. Tenth International Conference on Parallel and Distributed
Computing Systems, New Orleans, Louisiana. Oct. 1-3, 1997,
International Society for Computers and their Applications,
pp. 417 -- 421.
1996
- Z. Li, J. Tsai, P.C-. Yew, X. Wang and B. Zheng, Compiler techniques for concurrent multithreading
with hardware speculation support, Ninth International Workshop on Languages
and Compilers for Parallel Computing, Santa Clara, CA, August 1996,
LNCS 1239, Springer-Verlag.
1995
- J. Gu, Z. Li and G. Lee, Symbolic array dataflow analysis
for array privatization and program parallelization, in Proc. Supercomputing '95 , Dec., 1995.
- J. Gu, Z. Li and T. N. Nguyen, An interprocedural parallelizing compiler and its support
for memory hierarchy research, Proc. Eighth International Workshop on Languages
and Compilers for Parallel Computing , Columbus, Ohio, August 1995,
LNCS 1033, Springer-Verlag.
- J. Willis, Z. Li and T.-P. Lin,
``Use of embedded scheduling to compile VHDL for effective
parallel simulation'', Proc. European Design Automation
Conference with EURO-VHDL, Sept. 1995, Brighton, UK.
1994
- F. Toussi-Mounes, D. Lilja, Z. Li, ``Compiler Support for
Reducing the Network Traffic and the Miss Ratio in
Directory-Based Cache Coherence Mechanisms'', Proc. 1994 ACM International Conference on Supercomputing , Manchester, UK,
August, 1994, ACM Press.
- Z. Li, T. N. Nguyen, ``An Empirical Study of Workload Distribution
under Static Scheduling'', in Proc. 1994 International Conference
on Parallel Processing , St. Charles, Illinois, August 1994, CRC Press, Inc.
- T. N. Nguyen, F. Mounes-Toussi, D. J. Lilja, Z. Li,
``A compiler-assisted scheme for adaptive cache coherence enforcement'', Proc. IFIP Conf. on Parallel Architectures and Compilation Techniques (PACT),
Montreal, Canada,
IFIP Transactions, North-Holland,
August, 1994.
- Z. Li, ``Software assistance for directory-based caches'', Proc. 8th IEEE International Parallel
Processing Symposium, 1994, IEEE Computer Society Press.
1993
- R. Eigenmann, J. Hoeflinger, G. Jaxon, Z. Li, and D. Padua,
``Restructuring Fortran programs for Cedar,'' Concurrency -- Experience and Practice, 5(7), 553-573,
Oct. 1993.
- T. Nguyen, Z. Li, D. Lilja,
``Efficient use of dynamically tagged cache directories through
compiler analysis,''
in Proceedings of
1993 International Conference on Parallel Processing , Aug. 1993, CRC Press, Inc.
- D. Kuck, E. Davidson, D. Lawrie, A. Sameh, C.-Q. Zhu, A. Veidenbaum,
J. Konicek, P. Yew, K. Gallivan, W. Jalby, H. Wijshoff, R. Bramley, U.M. Yang,
P. Emrath, D. Padua, R. Eigenmann, J. Hoeflinger, G. Jaxon, Z. Li, T. Murphy,
J. Andrews, and S. Turner,
``The Cedar system and an initial performance study,'' Proc. 20th International Symposium
on Computer Architecture , San Diego, CA, May 16-19, 1993, ACM Press.
1992
- Z. Li, Array privatization for parallel loop execution, Proc. Sixth ACM International Conf. on Supercomputing,
Washington, D.C., July, 1992, ACM Press.
- R. Eigenmann, J. Hoeflinger, Z. Li, and D. Padua, ``Experience in the automatic parallelization of four Perfect-benchmark
programs,'' Proc. Fifth International Workshop on Languages
and Compilers for Parallel Computing , August 1992,
Springer-Verlag.
1991
- R. Eigenmann, J. Hoeflinger, G. Jaxon, Z. Li, and D. Padua,
``Restructuring Fortran programs for Cedar'',
1991 International Conference on
Parallel Processing, Aug. 1991.
1990
- Z. Shen, Z. Li and P.-C. Yew,
``An Empirical Study on Program Characteristics for Parallelizing Compilers,'' IEEE Trans. on Parallel and Distributed Systems , 1(3),
pp. 356-364, July 1990.
- Z. Li, P.-C. Yew and C.-Q. Zhu,
``An efficient data dependence analysis for parallelizing compilers,'' IEEE Trans. on Parallel and Distributed Computing, 1(1),
pp. 26-34, Jan. 1990
1989 and before
- Z. Li and E. Reingold,
``Solution of a divide-and-conquer maximin recurrence,'' SIAM Journal on Computing , Vol. 18, No. 6, pp. 1180-1200, December 1989.
- Z. Li and W. Abu-Sufah,
``On reducing data synchronization in multiprocessed loops,'' IEEE Trans. on Computers , Vol. C-36, pp. 105-109, No. 1, Jan. 1987.
- Z. Li and P.-C. Yew,
``Program parallelization with interprocedural analysis,'' The Journal of Supercomputing , Kluwer Academic Publishers,
Vol. 2, No. 2, pp. 225-244, Oct. 1988.
- Z. Li and P.-C. Yew,
``Efficient interprocedural analysis for program parallelization
and restructuring,'' 1988 ACM/SIGPLAN Conference on Parallel Programming:
Experience with Applications, Languages and Systems, July 1988,
ACM Press. (This conference was the precursor of today's biennial
ACM/SIGPLAN Symp. on Principles and Practice of Parallel Programming
(PPoPP).)
- Z. Li and P.-C. Yew,
``Interprocedural analysis for parallel programs,'' Proc. 1988 International Conference on Parallel
Processing ,
vol.II, pp. 221-228, August, 1988, Penn-State Press.
- Z. Li and W. Abu-Sufah,
``A technique for reducing synchronization overhead in large scale
multiprocessors,'' Proc. 12th Annual International Symposium
on Computer Architecture, pp. 284-291, 1985, ACM Press.