skip to main content
Department of Computer Science University of Colorado Boulder
cu: home | engineering | mycuinfo | about | cu a-z | search cu | contact cu cs: about | calendar | directory | catalog | schedules | mobile | contact cs
home · events · colloquia · 1999-2000 · 

Colloquium - Speight

ECCR 265

Efficient Parallel Computing on Clusters of Multiprocessors
Rice University

The face of parallel computing has changed significantly over the last decade. Thanks to the availability of low cost, high-performance workstations and user-level networks, clusters of small-scale symmetric multiprocessors have emerged as a viable alternative to previous, expensive monolithic systems. These clusters provide an excellent price/performance operating point for a variety of small to medium scale parallel programming needs, but providing a cohesive programming environment in such a context remains a challenging task.

This talk will describe the Brazos Parallel Programming Environment. Brazos has been under development for five years, and addresses many of the problems associated with utilizing clusters as a single parallel computer. In particular, work to date allows Brazos to provide both a shared memory programming interface through the use of ANL macros, and a message-passing interface through an implementation of the MPI library. Furthermore, Brazos programmers have the option of using both programming styles in the same application. Brazos achieves superior performance on shared memory applications through the selective use of multicast communication, adaptive runtime performance tuning, and a software adaptation of scope consistency. Brazos has also been adapted to use the Virtual Interface (VI) Architecture, a proposed industry-standard low-latency, user-level network adopted by Microsoft, Intel, and Compaq. The performance gained by tailoring Brazos to make use of the specific features of the VI Architecture will be discussed.

Current work on Brazos seeks to address two problems associated with using clusters to solve large-scale parallel problems: cluster reliability and cluster resource sharing. Brazos utilizes user-level thread migration combined with a checkpoint/recovery mechanism to provide a reliable system that tolerates single or multiple node failure without necessitating a restart of the parallel applications currently running. Additionally, Brazos provides a multiprogrammed parallel environment, allowing the runtime system to take advantage of processor and network under-utilization without the expense of executing multiple instances of the runtime support system on each node. The mechanisms and performance of this ongoing work will be presented.

Refreshments will be served in ECOT 831 immediately following the talk.
Hosted by Gary Nutt.

The Department holds colloquia throughout the Fall and Spring semesters. These colloquia, open to the public, are typically held on Thursday afternoons, but sometimes occur at other times as well. If you would like to receive email notification of upcoming colloquia, subscribe to our Colloquia Mailing List. If you would like to schedule a colloquium, see Colloquium Scheduling.

Sign language interpreters are available upon request. Please contact Stephanie Morris at least five days prior to the colloquium.

See also:
Department of Computer Science
College of Engineering and Applied Science
University of Colorado Boulder
Boulder, CO 80309-0430 USA
Send email to

Engineering Center Office Tower
ECOT 717
FAX +1-303-492-2844
XHTML 1.0/CSS2 ©2012 Regents of the University of Colorado
Privacy · Legal · Trademarks
May 5, 2012 (13:29)