News | Microsoft Unveils ND H100 v5 Virtual Machines Offering Nvidia’s Most Robust GPUs

Microsoft Unveils ND H100 v5 Virtual Machines Offering Nvidia’s Most Robust GPUs

Published by: Insights Desk Released: Aug 08, 2023 Source: DemandTalk

Highlights:

Microsoft’s ND H100 v5 VMs appear to be among its most potent computational offerings to date.
Enterprises can register their interest in the new VMs, and the company promises to expand the offering to make hundreds of thousands of H100 GPUs available to customers by next year.

Microsoft Corp. plans to encourage organizations to deploy their most advanced artificial intelligence projects to the Azure cloud platform by providing access to Nvidia Corp.’s latest and most powerful graphics processing units.

The company has announced that its ND H100 v5 Virtual Machine series is now generally accessible via the Azure cloud. It gives customers access to the high-performance computing infrastructure needed to train and run generative AI models.

Furthermore, it is expanding the availability of its Azure OpenAI Service, which allows clients to study the most advanced AI models developed by ChatGPT maker OpenAI LP. According to the company, Azure OpenAI is available worldwide in different regions.

Microsoft’s ND H100 v5 VMs appear to be among its most potent computational offerings to date. They’re already available in Azure’s East and West US regions, and they’re outfitted with eight of Nvidia’s most powerful H100 GPUs.

Nvidia introduced the H100 GPUs last year, stating that they are built on the company’s revolutionary Hopper architecture. They can provide orders of magnitude more processing power than the Nvidia A100 GPUs utilized to train the initial ChatGPT.

The benefit of the H100 GPUs is that they offer “significantly faster AI model performance” than the earlier generation of GPUs, Nvidia stated at the time of release. The ND H100 v5 also exhibits Intel Corp’s modern fourth Gen Intel Xeon Scalable CPUs featuring low-latent networking through Nvidia’s Quantum-2 CX7 InfiniBand technology. They also integrate DDR5 memory that facilitates quicker data transfer speeds to tackle the largest AI training datasets and PCIe Gen5, which offers 64 GB/sec bandwidth per GPU.

Microsoft makes some bold promises regarding the ND H100 v5 instances’ performance, including a six-fold increase in matrix multiplication operations and 1.8 times faster inference on large language models like OpenAI’s GPT-BLOOM 175B.

Organizations can express their interest and register for the latest VMs, and the company assures to extend the offering to create a large number of H100 GPUs for customers by next year.

Because the ND H100 v5 VMs are designed for generative AI workloads, Microsoft is also expanding the Azure OpenAI Service, which enables direct access to OpenAI’s cutting-edge AI models, GPT-4, and GPT-35-Turbo. The Azure OpenAI service, which was launched in January, was initially available only in Azure’s East United States, France Central, South Central United States, and West Europe regions but has now been expanded to the East United States 2, UK South Canada East, and Japan East, as per the company.

Holger Mueller of Constellation Research Inc. mentioned this latest development shows how keen Microsoft has been to bolster its AI solutions and make them robust and extensively available. “It wants to make OpenAI available to more customers and it’s offering Nvidia-based virtual machines alongside it, for customers to run and monetize their custom AI models. The speed of the rollout, along with availability and stability, is key for customer adoption, as many enterprises are tied to specific regions due to data residency and security requirements,” he added.

According to Microsoft, the Azure OpenAI Service is already utilized by over 11,000 commercial customers and is growing at a rate of roughly 100 new users daily.

Nidhi Chappell, General Manager of Azure HPC, AI, SAP, and Confidential Computing at Microsoft, said, “As part of this expansion, we are increasing the availability of GPT-4, our most advanced generative AI model, across the new regions. This enhancement allows more customers to leverage GPT-4’s capabilities for content generation, document intelligence, customer service and beyond.”

single-vendor sase for dummies...

critical guidance for evaluating sase solutions...

choosing the best sase solution for your hybrid wo...

giving data centers a clear roadmap to it sustaina...

protect data across hybrid cloud environments with...

oracle cloud for cios: lead your ai vision to real...

10 cloud trends cios must track in 2024...

the oracle playbook for it systems excellence...

product guide: automated reconciliation solution...

how to streamline cloud security and embrace sase...

get to know the content cloud for federal governme...

streamline multi-cloud networking: leverage equini...

the power to adapt workday quick demo...

unify your communications...

cloud phone system buyers...

cloud phone system buyers...

2023 gartner magic quadrant for cloud erp for serv...

how to secure your content in the cloud with box f...

business value of dell vxrail hci...

migrate from centos linux to a cloud-ready operati...

leveraging multi-tenant architecture for scalabili...

how cost-effective cloud accounting software strea...

cloud managed services selection criteria explored...

msp success with remote monitoring and management ...

cloud computing data security: 10 best practices f...

the versatility of cloud pbx phone systems...

unleashing the power of backup as a service (baas)...

private cloud computing – a green flag for all b...

cloud workload protection platforms (cwpp) unveile...

dynamics of end-user computing unveiled...

cloud application security strengthening the digit...

understanding cloud concentration risks in modern ...

cloud automation alleviating hurdles in cloud mana...

immutable infrastructure's impact: benefits, ci/cd...

sovereign cloud: a necessity or trendy choice?...

boosting your business with network-attached stora...

decoding the essentials and future outlook of dist...

network attached storage – the building blocks f...

core elements in distributed file system architect...

finops cloud – evolving with the framework...

qa wolf secures usd 36 m to enhance app testing...

alphabet call offs hubspot acquisition plans...

aws introduces graviton4, fourth generation custom...

coder raises usd 35 m to expand worldwide and enha...

rocketlane raises usd 24 m to expand automation d...

nvidia reportedly acquires shoreline for usd 100 m...

suse acquires stackstate, a kubernetes observabili...

restate secures usd 7 m for development of fault-t...

sap acquires walkme, a saas platform developer for...

five9-salesforce ai partnership elevates cx at con...

transcend secures usd 40 m to help enterprises pro...

the bridgepoint lumapps deal closes at usd 650m...

vercel raises usd 250 m at usd 3.25 b valuation...

the permira squarespace acquisition agreement unvi...

ibm announces acquisition of hashicorp inc. for us...

the potential hashicorp acquisition by ibm could b...

salesforce will not acquire informatica, a data ma...

hr software maker rippling people center funding a...

google llc’s axion cpu unit debuted in las vegas...

pigment sas secures usd 145 m for innovation in bu...

Microsoft Unveils ND H100 v5 Virtual Machines Offering Nvidia’s Most Robust GPUs

Highlights:

Insights Desk

Related posts

QA Wolf Secures USD 36 M to Enhance App Testing...

Alphabet Call Offs HubSpot Acquisition Plans...

AWS Introduces Graviton4, Fourth Generation Custom...

Coder Raises USD 35 M to Expand Worldwide and Enha...

Rocketlane Raises USD 24 M to Expand Automation D...

Nvidia Reportedly Acquires Shoreline for USD 100 M...

SUSE Acquires StackState, a Kubernetes Observabili...

Restate Secures USD 7 M for Development of Fault-t...

SAP Acquires WalkME, a SaaS Platform Developer for...

Five9-Salesforce AI Partnership Elevates CX at Con...

Our Brands