|
J. Fritts, F. Steiling, and J. Tucek, “MediaBench II video: Expediting the Next Generation of Video Systems Research,” Microprocessors & Microsystems, 2005, vol. 33, no. 4, pp. 301-318
|
|
W. Gao, R. C. Zhao, L Han, “Research on SIMD Auto-vectorization Compiling Optimization,” Journal of Software, 2015, 26(6):1265?1284 (in Chinese)
|
|
T. Hiroaki, Y. Akeuchi, K. Sakanushi, et al. “Pack Instruction Generation for Media Processors Using Multi-valued Decision Diagram,” in Proceedings of the 4th International Conference on Hardware/Software Codesign and System Synthesis. ACM, 2006:154-159
|
|
S. Larsen, S. Amarasinghe, “Exploiting Superword level Parallelism with Multimedia Instruction Sets,” Acm Sigplan Notices, 2000, 35(5), 145-156
|
|
NAS parallel benchmark suite, Avaiklable at http://www.nas.nasa.gov/ Resources/Software/npb.html, Last accessed on June 16, 2014
|
|
M. Prieto, L. Pinuel, F. Catthoor, et al. “Improving Superword Level Parallelism Support in Modern Compilers,” IEEE/ACM/IFIP International Conference on Hardware/software Codesign and System Synthesis, CODES+ISSS 2005, Jersey City, Nj, Usa, September. 2005:303-308
|
|
L. N. Pouchet, “PolyBench: The polyhedral benchmark suite,” Available at http://www.cs.ucla.edu/?pouchet/software/
|
|
V. Porpodas and T. Jones, “Throttling Automatic Vectorization: When Less is More,” International Conference on Parallel Architecture & Compilation. IEEE, 2015:432-444
|
|
V. Porpodas, A. Magni, T. M. JONES, “PSLP: Padded SLP Automatic Vectorization,” Code Generation and Optimization (CGO), 2015 IEEE/ACM International Symposium on. IEEE, 2015:190-201
|
|
“Spec cpu2006,” Available at http://www.spec.org/cpu2006/, Last accessed on August 24, 2015
|
|
W. M. Joseph, “High Performance Compilers for Parallel Computing,” 1996
|
|
W. Y. Suo, R. C. Zhao, Y. Yao, “Superword Level Parallelism Instruction Analysis and Redundancy Optimization Algorithm on DSP,” Journal of Computer Applications, 2012, 32(12):3303-3307
|
|
Z. Gtoumavitis, B. WANG, “Towards a Holistic Approach to Auto-parallelization: Integrating Profile-driven Parallelism Detection and Machine-learning Based Mapping,” Acm Sigplan Notices, 2009, 44(6):177-187
|
|
S. Wei, R. C. Zhao, Y. Yao, “Loop-Nest Auto-Vectorization Based on SLP,” Journal of Software, 2012, 23(07):1717-1728
|
|
J. L. Xu, R. C. Zhao, L. Han, “Vector Exploring Path Optimization Algorithm of Superworld Level Parallelism with Subsection Constraints,” Journal of Computer Applications, 2015, 35(04):950-955
|