- Locality-Conscious Nested-Loops Parallelization
- Locality-Conscious Nested-Loops Parallelization
- ㆍ 저자명
- Parsa. Saeed,Hamzei. Mohammad
- ㆍ 간행물명
- ETRI journal
- ㆍ 권/호정보
- 2014년|36권 1호|pp.124-133 (10 pages)
- ㆍ 발행정보
- 한국전자통신연구원
- ㆍ 파일정보
- 정기간행물|ENG| PDF텍스트
- ㆍ 주제분야
- 기타
To speed up data-intensive programs, two complementary techniques, namely nested loops parallelization and data locality optimization, should be considered. Effective parallelization techniques distribute the computation and necessary data across different processors, whereas data locality places data on the same processor. Therefore, locality and parallelization may demand different loop transformations. As such, an integrated approach that combines these two can generate much better results than each individual approach. This paper proposes a unified approach that integrates these two techniques to obtain an appropriate loop transformation. Applying this transformation results in coarse grain parallelism through exploiting the largest possible groups of outer permutable loops in addition to data locality through dependence satisfaction at inner loops. These groups can be further tiled to improve data locality through exploiting data reuse in multiple dimensions.