Parallel Computing : on the Road to Exascale.
Material type:
- text
- computer
- online resource
- 9781614996217
- 004.35
- QA76.58 .P373 2016
Title Page -- Preface -- Conference Organisation -- Contents -- Invited Talks -- Bio-Inspired Massively-Parallel Computation -- Automatic Tuning of Task Scheduling Policies on Multicore Architectures -- Architectures and Performance -- Algorithms -- Algorithmic Scheme for Hybrid Computing with CPU, Xeon-Phi/MIC and GPU Devices on a Single Machine -- A Many-Core Machine Model for Designing Algorithms with Minimum Parallelism Overheads -- Performance -- CPU Performance Analysis Using Score-P on PRIMEHPC FX100 Supercomputer -- Performance Improvements of Polydisperse DEM Simulations Using a Loose Octree Approach -- Execution Performance Analysis of the ABySS Genome Sequence Assembler Using Scalasca on the K Computer -- Performance Model Based on Memory Footprint for OpenMP Memory Bound Applications -- Evaluating OpenMP Performance on Thousands of Cores on the Numascale Architecture -- Acceleration of Large Scale OpenFOAM Simulations on Distributed Systems with Multicore CPUs and GPUs -- Optimized Variant-Selection Code Generation for Loops on Heterogeneous Multicore Systems -- MPI Communication on MPPA Many-Core NoC: Design, Modeling and Performance Issues -- Drivers for Device to Device Streaming -- Programming Models and Methods -- Skeletons -- Portable Parallelization of the EDGE CFD Application for GPU-Based Systems Using the SkePU Skeleton Programming Library -- Structured Parallel Implementation of Tree Echo State Network Model Selection -- Java Implementation of Data Parallel Skeletons on GPUs -- Data Parallel Patterns in Erlang/OpenCL -- Hybrid Coarrays: A PGAS Feature for Many-Core Architectures -- Lapedo: Hybrid Skeletons for Programming Heterogeneous Multicore Machines in Erlang -- Accelerators -- Evaluation of 3-D Stencil Codes on the Intel Xeon Phi Coprocessor -- Hierarchical Parallelism in a Physical Modelling Synthesis Code.
Harnessing CUDA Dynamic Parallelism for the Solution of Sparse Linear Systems -- Model-Driven Development of GPU Applications -- Exploring the Offload Execution Model in the Intel Xeon Phi via Matrix Inversion -- Programming GPUs with C++14 and Just-In-Time Compilation -- Resource Management -- Active Packet Pacing as a Congestion Avoidance Technique in Interconnection Network -- Hybrid Parallelization of Hyper-Dimensional Vlasov Code with OpenMP Loop Collapse Directive -- Active Resource Management for Multi-Core Runtime Systems Serving Malleable Applications -- Improving Energy-Efficiency of Static Schedules by Core Consolidation and Switching Off Unused Cores -- Efficient Parallel Linked List Processing -- Streams -- Streams as an Alternative to Halo Exchange -- An Embedded C++ Domain-Specific Language for Stream Parallelism -- Pipeline Template for Streaming Applications on Heterogeneous Chips -- Applications -- Numerical Methods -- Efficient and Scalable Distributed-Memory Hierarchization Algorithms for the Sparse Grid Combination Technique -- Adapting a Finite-Element Type Solver for Bioelectromagnetics to the DEEP-ER Platform -- High Performance Eigenvalue Solver in Exact-Diagonalization Method for Hubbard Model on CUDA GPU -- A General Tridiagonal Solver for Coprocessors: Adapting g-Spike for the Intel Xeon Phi -- CAHTR: Communication-Avoiding Householder TRidiagonalization -- Simulation of External Aerodynamics of the DrivAer Model with the LBM on GPGPUs -- Algorithms -- A Parallel Algorithm for Decomposition of Finite Languages -- Exploiting the Space Filling Curve Ordering of Particles in the Neighbour Search of Gadget3 -- On-the-Fly Memory Compression for Multibody Algorithms -- Flexible and Generic Workflow Management -- A Massively Parallel Barnes-Hut Tree Code with Dual Tree Traversal -- Flow Problems.
Performance Modeling of a Compressible Hydrodynamics Solver on Multicore CPUs -- Developing a Scalable and Flexible High-Resolution DNS Code for Two-Phase Flows -- FPGA Port of a Large Scientific Model from Legacy Code: The Emanuel Convection Scheme -- Data Science -- How to Keep a Geographic Map Up-To-Date -- Static and Dynamic Big Data Partitioning on Apache Spark -- Mini-Symposium: ParaFPGA-2015: Parallel Computing with FPGAs -- ParaFPGA15: Exploring Threads and Trends in Programmable Hardware -- FPGAs as Components in Heterogeneous High-Performance Computing Systems: Raising the Abstraction Level -- FPGA Acceleration of SAT Preprocessor -- Leveraging FPGA Clusters for SAT Computations -- High-Speed Calculation of Convex Hull in 2D Images Using FPGA -- Workload Distribution and Balancing in FPGAs and CPUs with OpenCL and TBB -- A Run-Time System for Partially Reconfigurable FPGAs: The Case of STMicroelectronics SPEAr Board -- Exploring Automatically Generated Platforms in High Performance FPGAs -- Mini-Symposium: Experiences of Porting and Optimising Code for Xeon Phi Processors -- Symposium on Experiences of Porting and Optimising Code for Xeon Phi Processors -- Experiences Porting Production Codes to Xeon Phi Processors -- Preparing a Seismic Imaging Code for the Intel Knights Landing Xeon Phi Processor -- LU Factorisation on Xeon and Xeon Phi Processors -- Mini-Symposium: Coordination Programming -- Mini-Symposium on Coordination Programming - Preface -- Claud: Coordination, Locality and Universal Distribution -- Coordination with Structured Composition for Cyber-Physical Systems -- Mini-Symposium: Symposium on Parallel Solvers for Very Large PDE Based Systems in the Earth- and Atmospheric Sciences -- On Efficient Time Stepping Using the Discontinuous Galerkin Method for Numerical Weather Prediction.
Porting the COSMO Dynamical Core to Heterogeneous Platforms Using STELLA Library -- Towards Compiler-Agnostic Performance in Finite-Difference Codes -- Mini-Symposium: Is the Programming Environment Ready for Hybrid Supercomputers? -- Is the Programming Environment Ready for Hybrid Supercomputers? -- Utilizing Hybrid Programming Environments: CSCS Case Studies -- SYCL: Single-Source C++ Accelerator Programming -- Using Task-Based Parallelism Directly on the GPU for Automated Asynchronous Data Transfer -- A Strategy for Developing a Performance Portable Highly Scalable Application -- Mini-Symposium: Symposium on Energy and Resilience in Parallel Programming -- Mini-Symposium on Energy and Resilience in Parallel Programming -- Performance and Fault Tolerance of Preconditioned Iterative Solvers on Low-Power ARM Architectures -- Compiling for Resilience: The Performance Gap -- Automation of Significance Analyses with Interval Splitting -- Energy Minimization on Heterogeneous Systems Through Approximate Computing -- Landing Containment Domains on SWARM: Toward a Robust Resiliency Solution on a Dynamic Adaptive Runtime Machine -- Mini-Symposium: Symposium on Multi-System Application Extreme-Scaling Imperative -- MAXI - Multi-System Application Extreme-Scaling Imperative -- High Throughput Simulations of Two-Phase Flows on Blue Gene/Q -- Direct Numerical Simulation of Fluid Turbulence at Extreme Scale with psOpen -- Simulating Morphologically Detailed Neuronal Networks at Extreme Scale -- FE2TI: Computational Scale Bridging for Dual-Phase Steels -- Performance Evaluation of the LBM Solver Musubi on Various HPC Architectures -- Extreme-Scaling Applications 24/7 on JUQUEEN Blue Gene/Q -- Extreme Scale-Out SuperMUC Phase 2 - Lessons Learned -- K-scale" Applications on the K Computer and Co-Design Effort for the Development of "post-K -- Author Index.
Description based on publisher supplied metadata and other sources.
Electronic reproduction. Ann Arbor, Michigan : ProQuest Ebook Central, 2024. Available via World Wide Web. Access may be limited to ProQuest Ebook Central affiliated libraries.
There are no comments on this title.