Breaking News

be quiet! enters high-end gaming mouse market with Dark Perk Ergo and Dark Perk Sym ASUS ROG announces ROG Strix GS-BE7200 Dual-Band WiFi 7 Gaming Router Transcend Launches RDE3 microSD Express Card Reader for Next-Generation High-Speed Performance Akasa Unleashes Six New Low-Profile CPU Coolers Up to 165W TDP Cooling in Compact Form Factors SWIT announces Powercell Battery Series for Sony, Canon, Nikon, and Fujifilm Cameras

logo

  • Share Us
    • Facebook
    • Twitter
  • Home
  • Home
  • News
  • Reviews
  • Essays
  • Forum
  • Legacy
  • About
    • Submit News

    • Contact Us
    • Privacy

    • Promotion
    • Advertise

    • RSS Feed
    • Site Map

Search form

Nvidia Released  CUDA 4.0

Nvidia Released CUDA 4.0

GPUs Mar 1,2011 0

NVIDIA today announced the latest version of the NVIDIA CUDA Toolkit for developing parallel applications using NVIDIA GPUs. Nvidia said that the new NVIDIA CUDA 4.0 Toolkit was designed to make parallel programming easier, and enable more developers to port their applications to GPUs.

These are the three main features of the new toolkit:

- NVIDIA GPUDirect 2.0 Technology -- Offers support for peer-to-peer communication among GPUs within a single server or workstation. This enables faster multi-GPU programming and application performance.
- Unified Virtual Addressing (UVA) -- Provides a single merged-memory address space for the main system memory and the GPU memories, enabling quicker and easier parallel programming.
- Thrust C++ Template Performance Primitives Libraries -- Provides a collection of open source C++ parallel algorithms and data structures that ease programming for C++ developers. With Thrust, routines such as parallel sorting are 5X to 100X faster than with Standard Template Library (STL) and Threading Building Blocks (TBB).

The CUDA 4.0 architecture release also includes a number of other key features and capabilities, including:

- MPI Integration with CUDA Applications -- Modified MPI implementations automatically move data from and to the GPU memory over Infiniband when an application does an MPI send or receive call.
- Multi-thread Sharing of GPUs -- Multiple CPU host threads can share contexts on a single GPU, making it easier to share a single GPU by multi-threaded applications.
- Multi-GPU Sharing by Single CPU Thread -- A single CPU host thread can access all GPUs in a system. Developers can easily coordinate work across multiple GPUs for tasks such as "halo" exchange in applications.
- New NPP Image and Computer Vision Library -- A set of image transformation operations that enable rapid development of imaging and computer vision applications.

New Capabilities

- Auto performance analysis in the Visual Profiler
- New features in cuda-gdb and added support for MacOS
- Added support for C++ features like new/delete and virtual functions
- New GPU binary disassembler

A release candidate of CUDA Toolkit 4.0 will be available free of charge beginning March 4, 2011, by enrolling in the CUDA Registered Developer Program at: www.nvidia.com/paralleldeveloper.

Tags: Nvidiacuda
Previous Post
Panasonic Drops Game Console Project: report
Next Post
Intel Launches Graphics Performance Analyzers 4.0

Related Posts

  • NVIDIA introduces DLSS 4.5, Path Tracing and G-SYNC Pulsar Supercharge Gameplay

  • Intel and NVIDIA to Jointly Develop AI Infrastructure and Personal Computing Products

  • MSI Expands NVIDIA RTX PRO Server Lineup

  • PNY Announces Support for the New NVIDIA RTX PRO Blackwell Graphics Card Family

  • KIOXIA flash memory and SSD solutions empower AI applications at NVIDIA GTC 2025

  • INNO3D GEFORCE RTX 50 SERIES IS HERE!

  • VENGEANCE Gaming PCs are Ready for NVIDIA GeForce RTX 50 Series GPUs

  • GeForce At Computex 2024

Latest News

be quiet! enters high-end gaming mouse market with Dark Perk Ergo and Dark Perk Sym
Gaming

be quiet! enters high-end gaming mouse market with Dark Perk Ergo and Dark Perk Sym

ASUS ROG announces ROG Strix GS-BE7200 Dual-Band WiFi 7 Gaming Router
Enterprise & IT

ASUS ROG announces ROG Strix GS-BE7200 Dual-Band WiFi 7 Gaming Router

Transcend Launches RDE3 microSD Express Card Reader for Next-Generation High-Speed Performance
Cameras

Transcend Launches RDE3 microSD Express Card Reader for Next-Generation High-Speed Performance

Akasa Unleashes Six New Low-Profile CPU Coolers Up to 165W TDP Cooling in Compact Form Factors
Cooling Systems

Akasa Unleashes Six New Low-Profile CPU Coolers Up to 165W TDP Cooling in Compact Form Factors

SWIT announces Powercell Battery Series for Sony, Canon, Nikon, and Fujifilm Cameras
Cameras

SWIT announces Powercell Battery Series for Sony, Canon, Nikon, and Fujifilm Cameras

Popular Reviews

be quiet! Dark Mount Keyboard

be quiet! Dark Mount Keyboard

Terramaster F8-SSD

Terramaster F8-SSD

be quiet! Light Mount Keyboard

be quiet! Light Mount Keyboard

Soundpeats Pop Clip

Soundpeats Pop Clip

Akaso 360 Action camera

Akaso 360 Action camera

Dragon Touch Digital Calendar

Dragon Touch Digital Calendar

be quiet! Pure Loop 3 280mm

be quiet! Pure Loop 3 280mm

Noctua NF-A12x25 G2 fans

Noctua NF-A12x25 G2 fans

Main menu

  • Home
  • News
  • Reviews
  • Essays
  • Forum
  • Legacy
  • About
    • Submit News

    • Contact Us
    • Privacy

    • Promotion
    • Advertise

    • RSS Feed
    • Site Map
  • About
  • Privacy
  • Contact Us
  • Promotional Opportunities @ CdrInfo.com
  • Advertise on out site
  • Submit your News to our site
  • RSS Feed