Press Release Summary:
PGIÂ® Release 8.0 supports OpenMP 3.0 multicore parallel programming standard in Fortran and C across Linux/Mac OS X/Windows. It enables automatic offloading of parallel computations from x64 host processors to CUDA-enabled GPUs and analyzes source code automatically, listing possible or inhibited performance changes. Data is presented through Common Compiler Feedback Format. Program supports Microsoft HPC Server 2008 clusters and OpenMP parallel/local OpenMPI parallel debugging in Mac OS X.
Original Press Release:
The Portland Group Announces PGI 8.0 Optimizing Compilers and Tools for Multi-Core x64 Processors
Latest PGI release supports OpenMP 3.0 on Linux, Windows and MacOS; includes heterogeneous x64+GPU compiler technology preview
Portland, Ore, November 17, 2008 - The Portland Group®, a wholly-owned subsidiary of STMicroelectronics, today announced the general availability of the PGI® Release 8.0 line of high-performance compilers and development tools for Linux, Mac OS X and Windows. PGI Release 8.0 includes full support for the recently announced OpenMP 3.0 multi-core parallel programming standard in Fortran and C across all supported platforms. The new release also adds support for building and debugging of OpenMPI applications on both Linux and MacOS, complementing existing MPI capabilities on Linux and Windows clusters. PGI 8.0 users can now develop and deploy multi-core and parallel applications on any of the major desktop or cluster operating systems using identical PGI compilers, the latest OpenMP features, MPI implementation of choice and bundled OpenMP/MPI-capable debugging and profiling tools. In a significant new development, PGI Release 8.0 also marks The Portland Group's entry into the field of accelerated computing with provisional support for automatic offloading of parallel computations from x64 host processors to CUDA-enabled GPUs from NVIDIA.
"Together with PGI Unified Binary technology, which enables developers to leverage the latest CPU innovations from both AMD and Intel while treating x64 processors as a single platform, the new features in PGI 8.0 maximize flexibility and independence for HPC users and large multi-platform supercomputing centers," said Douglas Miles, director, The Portland Group.
New Performance Analysis Tools
In addition to building on a compiler and tools product line that now includes all best practices HPC and multi-core programming technologies, the PGI 8.0 compilers include an all-new capability to automatically analyze source code, produce an extensive database describing performance optimizations that are possible or inhibited, and provide advice for modifying the source code to take advantage of the possible optimizations. With Release 8.0, PGI has standardized the organization and interface to this data through the Common Compiler Feedback Format (CCFF). PGI is publishing the CCFF standard and making access to it freely available in an effort to improve the utility and interoperability of PGI, third-party and research-community software tuning tools. More information on CCFF is available from the PGI website at www.pgroup.com/ccff.
PGI's PGPROF®8.0 performance profiler displays CCFF data coupled with user source code in a logical, compact and intuitive graphical user interface (GUI). A command-line interface is also supported. Programmers can quickly and easily identify code segments that are already well-structured, as well as those that can be restructured to improve performance. In addition to identifying sections of an application that consume most of the compute time or system resources, PGPROF provides developers with specific actionable performance optimization feedback about their source code. The data, presented on a per-thread and/or per-process basis, simplifies performance tuning by identifying:
o Streaming SIMD Extensions (SSE) vector loops, and why vectorization is inhibited on non-vector loops
o Loops auto-parallelized for multi-core, and why parallelization is inhibited on serial loops
o Loops that are candidates for OpenMP parallelization
o Compute intensity of loops, and candidates for offloading to a GPU (Graphics Processing Unit) or accelerator
o Loops with very small or very large iteration counts, and how they can be modified to maximize performance for SSE and the cache-based memory hierarchy
o Data prefetching, and opportunities for prefetch tuning using directives and pragmas
In addition to these detailed analyses, PGPROF also includes overall program level analyses including information about in-lined functions and subroutines and information about how each file was compiled, comprehensive system configuration information and many other performance-critical characteristics of Fortran, C and C++ source code. Unlike traditional performance tuning tools which only report on and help tune performance for a specific type of processor or system, or focus solely on parallelization, the PGI 8.0 compilers and tools provide developers with feedback and insight on how to restructure loops and algorithms to enhance performance on any modern multi-core x64 CPU or GPU accelerator.
"Parallelism does not equate to performance," said Michael Wolfe, compiler engineer, The Portland Group. "The focus needs to be not on parallelism, but on performance, where parallelism is one of the tools to get it."
Provisional GPU Support
PGI Release 8.0 also includes a technology preview of the industry's first Fortran and C compilers that automatically offload computations from an x64 host program to a GPU. Until now, C and C++ developers targeting GPU accelerators have had to rely on language extensions to their programs. Use of GPUs from Fortran applications has been extremely limited. x64+GPU programmers have been required to program at a detailed level including a need to understand and specify data usage information and manually construct sequences of calls to manage all movement of data between the x64 host and GPU. Using the provisional support in PGI Release 8.0, programmers can accelerate Linux applications on x64+GPU platforms by adding OpenMP-like compiler directives to existing high-level standard-compliant Fortran and C programs and then recompiling with appropriate compiler options.
"PGI is joining the increasing number of software publishers offering innovative approaches to harnessing the power of NVIDIA GPUs by leveraging the CUDA development environment," said Andy Keane, general manager, Tesla computing solutions, NVIDIA. "With their 20 year history and track record of success, we expect PGI's offering will open the door for members of the HPC community to begin incrementally porting large legacy production science and engineering codes to take full advantage of NVIDIA Tesla accelerators."
The PGI 8.0 x64+GPU compilers automatically analyze whole program structure and data, split portions of the application between the x64 CPU and GPU as specified by user directives, and define and generate an optimized mapping of loops to automatically use the parallel cores, hardware threading capabilities and SIMD vector capabilities of modern GPUs. In addition to directives and pragmas that specify regions of code or functions to be accelerated, the PGI Fortran and C compilers support user directives that give the programmer fine-grained control over the mapping of loops, allocation of memory, and optimization for the GPU memory hierarchy. The PGI compilers generate unified x64+GPU object files and executables that manage all movement of data to/from the GPU device while leveraging all existing host-side utilities - linker, librarians, makefiles - and require no changes to the existing standard HPC Linux/x64 programming environment.
Another significant new feature included in PGI Release 8.0 is support for OpenMP parallel and local OpenMPI parallel debugging in Mac OS X, new simplified licensing setup on Microsoft Windows, support for Microsoft HPC Server 2008 clusters and support for the latest processors from AMD and Intel.
Evaluation copies of the new PGI compilers are available from The Portland Group web site at www.pgroup.com. Registration is required.
About The Portland Group
The Portland Group, a wholly-owned subsidiary of STMicroelectronics, is the premier supplier of high-performance Fortran, C, and C++ compilers and tools for high-end computing systems and x64 and x86 processor-based workstations, servers, and clusters. PGI® products are used widely by engineers, researchers and scientists in high-performance computing (HPC), the field of technical computing engaged in the modeling and simulation of complex processes, such as ocean modeling, weather forecasting, seismic analysis, bioinformatics and other areas. PGI compilers, which convert software programs into the binary instructions that computers understand, are recognized in the HPC community for delivering world-class performance across a wide spectrum of applications and benchmarks, and they are referenced regularly as the industry standard for performance and reliability.
Further information on The Portland Group products can be found at www.pgroup.com , by calling Sales at (503) 682-2806, or by email to firstname.lastname@example.org.
STMicroelectronics is a global leader in developing and delivering semiconductor solutions across the spectrum of microelectronics applications. An unrivalled combination of silicon and system expertise, manufacturing strength, Intellectual Property (IP) portfolio and strategic partners positions the Company at the forefront of System-on-Chip (SoC) technology and its products play a key role in enabling today's convergence markets. The Company's shares are traded on the New York Stock Exchange, on Euronext Paris and on the Milan Stock Exchange. In 2007, the Company's net revenues were $10.0 billion. Further information on ST can be found at www.st.com.
All trademarks, trade names, service marks, and logos referenced herein belong to their respective companies.
+1 212 821-8959
US Media Relations
603 465 9213