Generative AI, Hardware and Machine Learning

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. In our tests, we’ve seen substantial improvements in scaling times for generative AI model endpoints across various frameworks.

Generative AI

Generative AI Artificial Inteligence Machine Learning AWS

OctoML raises $85M for it for its machine learning acceleration platform

TechCrunch

NOVEMBER 1, 2021

OctoML , a Seattle-based startup that helps enterprises optimize and deploy their machine learning models, today announced that it has raised an $85 million Series C round led by Tiger Global Management. “If you make something twice as fast on the same hardware, making use of half the energy, that has an impact at scale.”

Artificial Inteligence

Artificial Inteligence Machine Learning Hardware Energy

Dulling the impact of AI-fueled cyber threats with AI

CIO

OCTOBER 24, 2024

IT leaders are placing faith in AI. Consider 76 percent of IT leaders believe that generative AI (GenAI) will significantly impact their organizations, with 76 percent increasing their budgets to pursue AI. But when it comes to cybersecurity, AI has become a double-edged sword.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Generative AI Training

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Stability AI backs effort to bring machine learning to biomed

TechCrunch

NOVEMBER 4, 2022

Stability AI , the venture-backed startup behind the text-to-image AI system Stable Diffusion, is funding a wide-ranging effort to apply AI to the frontiers of biotech. Stability AI’s ethically questionable decisions to date aside, machine learning in medicine is a minefield. Looking ahead.

Artificial Inteligence

Artificial Inteligence Machine Learning Biotech Training

Reduce ML training costs with Amazon SageMaker HyperPod

AWS Machine Learning - AI

APRIL 10, 2025

As cluster sizes grow, the likelihood of failure increases due to the number of hardware components involved. Each hardware failure can result in wasted GPU hours and requires valuable engineering time to identify and resolve the issue, making the system prone to downtime that can disrupt progress and delay completion.

Training

Training Artificial Inteligence Hardware Systems Review

Gartner projects major IT spending increases for 2025

CIO

OCTOBER 24, 2024

growth this year, with data center spending increasing by nearly 35% in 2024 in anticipation of generative AI infrastructure needs. This spending on AI infrastructure may be confusing to investors, who won’t see a direct line to increased sales because much of the hyperscaler AI investment will focus on internal uses, he says.

Data Center

Data Center Artificial Inteligence Generative AI Artificial Intelligence

9 IT skills where expertise pays the most

CIO

APRIL 25, 2025

As one of the most sought-after skills on the market right now, organizations everywhere are eager to embrace AI as a business tool. AI skills broadly include programming languages, database modeling, data analysis and visualization, machine learning (ML), statistics, natural language processing (NLP), generative AI, and AI ethics.

Artificial Inteligence

Artificial Inteligence DevOps Virtualization Industry

Together raises $20M to build open source generative AI models

TechCrunch

MAY 15, 2023

Generative AI — AI that can write essays, create artwork and music, and more — continues to attract outsize investor attention. According to one source, generative AI startups raised $1.7 billion in Q1 2023, with an additional $10.68 billion worth of deals announced in the quarter but not yet completed.

Open Source

Open Source Generative AI ChatGPT Hardware

Timekettle’s $699 translation hardware handles multiple languages at once

TechCrunch

JANUARY 8, 2024

Improvements to processing power, machine learning and cloud platforms have all played key roles in this development. The technology is increasingly becoming a mainstay of wireless earbuds, and the recent explosion of generative AI platforms will only serve to further these impressive results.

Hardware

Hardware Wireless Generative AI Artificial Inteligence

Preparing the foundations for Generative AI

CIO

FEBRUARY 20, 2024

Governments and public services agencies are keen to push forwards with generative AI. Yet making this shift isn’t simply a matter of adopting generative AI tools and hoping this alone will drive success. Data also needs to be sorted, annotated and labelled in order to meet the requirements of generative AI.

Generative AI

Generative AI Government Infrastructure Cloud

AI agents loom large as organizations pursue generative AI value

CIO

AUGUST 19, 2024

Yet as organizations figure out how generative AI fits into their plans, IT leaders would do well to pay close attention to one emerging category: multiagent systems. Agents come in many forms, many of which respond to prompts humans issue through text or speech.

Generative AI

Generative AI Artificial Inteligence Organization Technical Review

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Open foundation models (FMs) have become a cornerstone of generative AI innovation, enabling organizations to build and customize AI applications while maintaining control over their costs and deployment strategies. You can access your imported custom models on-demand and without the need to manage underlying infrastructure.

Generative AI

Generative AI Artificial Inteligence AWS Technical Review

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

AWS Machine Learning - AI

NOVEMBER 14, 2024

In the rapidly evolving world of generative AI image modeling, prompt engineering has become a crucial skill for developers, designers, and content creators. Understanding the Prompt Structure Prompt engineering is a valuable technique for effectively using generative AI image models. A photo of a (red:1.2)

Engineering

Engineering AWS 3D Generative AI

A secure approach to generative AI with AWS

AWS Machine Learning - AI

APRIL 16, 2024

Generative artificial intelligence (AI) is transforming the customer experience in industries across the globe. The biggest concern we hear from customers as they explore the advantages of generative AI is how to protect their highly sensitive data and investments.

Generative AI

Generative AI AWS Artificial Inteligence Infrastructure

CoreWeave, a GPU-focused cloud compute provider, lands $221M investment

TechCrunch

APRIL 20, 2023

Venturo, a hobbyist Ethereum miner, cheaply acquired GPUs from insolvent cryptocurrency mining farms, choosing Nvidia hardware for the increased memory (hence Nvidia’s investment in CoreWeave, presumably). ” Them’s fighting words, to be sure, especially as AWS launches a dedicated service for serving text-generating models.

Artificial Inteligence

Artificial Inteligence Cloud Generative AI Google Cloud

Putting AI to Work: Generative AI Meets the Enterprise

CIO

APRIL 7, 2023

Generative AI (GenAI), the basis for tools like OpenAI ChatGPT, Google Bard and Meta LLaMa, is a new AI technology that has quickly moved front and center into the global limelight. Five days after its launch, ChatGPT exceeded 1 million users 1. To find out more visit our website.

Artificial Inteligence

Artificial Inteligence Generative AI Enterprise Meeting

Unlocking Innovation: AWS and Anthropic push the boundaries of generative AI together

AWS Machine Learning - AI

MARCH 4, 2024

Amazon Bedrock is the best place to build and scale generative AI applications with large language models (LLM) and other foundation models (FMs). It enables customers to leverage a variety of high-performing FMs, such as the Claude family of models by Anthropic, to build custom generative AI applications.

Generative AI

Generative AI AWS Artificial Inteligence Innovation

From edge to cloud: The critical role of hardware in AI applications

CIO

JUNE 6, 2023

The artwork I received was not only visually stunning but also showed how AI is capable of bringing new ideas to life. I was experiencing first-hand, as a creator, the transformative nature of generative AI with Midjourney, chatGPT, and other tools. Midjourney AI is quickly becoming ubiquitous now.

Hardware

Hardware Artificial Inteligence Applications Wireless

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

The increased usage of generative AI models has offered tailored experiences with minimal technical expertise, and organizations are increasingly using these powerful models to drive innovation and enhance their services across various domains, from natural language processing (NLP) to content generation.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

There are additional optional runtime parameters that are already pre-optimized in TGI containers to maximize performance on host hardware. We didnt try to optimize the performance for each model/hardware/use case combination. All models were run with dtype=bfloat16. Short-length test 512 input tokens, 256 output tokens.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

How Cisco accelerated the use of generative AI with Amazon SageMaker Inference

AWS Machine Learning - AI

AUGUST 8, 2024

Webex’s focus on delivering inclusive collaboration experiences fuels their innovation, which uses artificial intelligence (AI) and machine learning (ML), to remove the barriers of geography, language, personality, and familiarity with technology. Its solutions are underpinned with security and privacy by design.

Generative AI

Generative AI Artificial Inteligence AWS Machine Learning

Learn how Amazon Ads created a generative AI-powered image generation capability using Amazon SageMaker

AWS Machine Learning - AI

MAY 15, 2024

To help advertisers more seamlessly address this challenge, Amazon Ads rolled out an image generation capability that quickly and easily develops lifestyle imagery, which helps advertisers bring their brand stories to life. We end with lessons learned. Watch this presentation to learn how you can start your project with JumpStart.

Generative AI

Generative AI Artificial Inteligence Advertising Technical Review

The Rise Of AI-Powered Robotics, And The Future Of Work

Ooda Loop

APRIL 16, 2025

While ChatGPT and generative AI dominate headlines, a quieter revolution is unfolding in AI-powered robotics, transforming businesses and reshaping industries. Far from science fiction, these intelligent machines are automating tasks, boosting efficiency, and sparking debates about their impact on jobs.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence ChatGPT Generative AI

7 tech trends that have changed the tech landscape in 2023

CIO

DECEMBER 13, 2023

Generative AI and Foundational Models – Building on applied AI and industrializing machine learning, generative AI has emerged as a powerful force across industries. – It takes assistive technology to new heights, reducing application development time and empowering non-technical users. – Generative AI is expected to contribute up to $4.4

Generative AI

Generative AI Trends Artificial Inteligence Machine Learning

Best practices to build generative AI applications on AWS

AWS Machine Learning - AI

MARCH 14, 2024

Generative AI applications driven by foundational models (FMs) are enabling organizations with significant business value in customer experience, productivity, process optimization, and innovations. In this post, we explore different approaches you can take when building applications that use generative AI.

Generative AI

Generative AI AWS Applications Artificial Inteligence

AI on the mainframe? IBM may be onto something

CIO

OCTOBER 3, 2024

Rather than pull away from big iron in the AI era, Big Blue is leaning into it, with plans in 2025 to release its next-generation Z mainframe , with a Telum II processor and Spyre AI Accelerator Card, positioned to run large language models (LLMs) and machine learning models for fraud detection and other use cases.

Artificial Inteligence

Artificial Inteligence Generative AI Machine Learning Enterprise

The AI continuum

CIO

JANUARY 24, 2024

ChatGPT has turned everything we know about AI on its head. AI encompasses many things. Generative AI and large language models (LLMs) like ChatGPT are only one aspect of AI. But it’s the well-known part of AI. The price-performance value of consuming AI via the tools you already use is hard to beat.

Artificial Inteligence

Artificial Inteligence Generative AI ChatGPT Machine Learning

Welcome to a New Era of Building in the Cloud with Generative AI on AWS

AWS Machine Learning - AI

NOVEMBER 30, 2023

We believe generative AI has the potential over time to transform virtually every customer experience we know. Innovative startups like Perplexity AI are going all in on AWS for generative AI. And at the top layer, we’ve been investing in game-changing applications in key areas like generative AI-based coding.

Generative AI

Generative AI AWS Artificial Inteligence Software Review

3 steps to get your data AI ready

CIO

MARCH 26, 2025

AI-ready data is not something CIOs need to produce for just one application theyll need it for all applications that require enterprise-specific intelligence. Unfortunately, many IT leaders are discovering that this goal cant be reached using standard data practices, and traditional IT hardware and software.

CTO

CTO Data Infrastructure Artificial Inteligence

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning - AI

NOVEMBER 26, 2024

The use of large language models (LLMs) and generative AI has exploded over the last year. max-num-seqs 32 : This is set to the hardware batch size or a desired level of concurrency that the model server needs to handle. block-size 8 : For neuron devices, this is internally set to the max-model-len. --max-num-seqs

Artificial Inteligence

Artificial Inteligence AWS Artificial Intelligence Generative AI

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies, such as AI21 Labs, Anthropic, Cohere, Meta, Mistral, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

Beyond ChatGPT: Secret robotics plans and the $38 billion humanoid revolution

CIO

MARCH 20, 2025

Amid this AI arms race, OpenAIs latest trademark application with the United States Patent and Trademark Office (USPTO) shows that the organization has other goals beyond LLMs. The application lists various hardware such as AI-powered smart devices, augmented and virtual reality headsets, and even humanoid robots.

Artificial Inteligence

Artificial Inteligence ChatGPT Artificial Intelligence Hardware

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning - AI

NOVEMBER 27, 2024

Launching a machine learning (ML) training cluster with Amazon SageMaker training jobs is a seamless process that begins with a straightforward API call, AWS Command Line Interface (AWS CLI) command, or AWS SDK interaction. About the Authors Kanwaljit Khurmi is a Principal Worldwide Generative AI Solutions Architect at AWS.

Training

Training Artificial Inteligence AWS Machine Learning

Intel spins off enterprise AI company Articul8 with outside funding

CIO

JANUARY 4, 2024

Intel has set up a new company, Articul8 AI, to sell enterprise generative AI software it developed. Articul8 AI will be led by Arun Subramaniyan, formerly vice president and general manager in Intel’s Data Center and AI Group. AMD too has been building up the software component of its AI stack.

Enterprise

Enterprise Company Generative AI Telecommunications

10 highest-paying IT skills for 2024

CIO

APRIL 12, 2024

These roles include data scientist, machine learning engineer, software engineer, research scientist, full-stack developer, deep learning engineer, software architect, and field programmable gate array (FPGA) engineer. It is used to execute and improve machine learning tasks such as NLP, computer vision, and deep learning.

Open Source

Open Source Artificial Inteligence Machine Learning Generative AI

Generative AI foundation model training on Amazon SageMaker

AWS Machine Learning - AI

OCTOBER 22, 2024

Business challenge Businesses today face numerous challenges in effectively implementing and managing machine learning (ML) initiatives. Additionally, organizations must navigate cost optimization, maintain data security and compliance, and democratize both ease of use and access of machine learning tools across teams.

Generative AI

Generative AI Training Artificial Inteligence Technical Advisors

Does your SMB have the foundation in place for GenAI?

CIO

OCTOBER 4, 2024

The extraordinary potential of generative AI (GenAI) has seen businesses scrambling to adopt the technology and realize untapped opportunities. But building an AI strategy is more than just deploying the newest GenAI tools. At the same time, companies are free to scale dynamically their use based on business needs.

SMB

SMB Hardware Generative AI Artificial Inteligence

How enterprises can navigate ethics and responsibility of generative AI

CIO

APRIL 27, 2023

In a few short months, generative AI has become a very hot topic. Looking beyond the hype, generative AI is a groundbreaking technology, enabling novel capabilities as it moves rapidly into the enterprise world. Here are ways to proactively preserve trust in generative AI implementations.

Generative AI

Generative AI Enterprise Artificial Inteligence Insurance

Modular secures $100M to build tools to optimize and create AI models

TechCrunch

AUGUST 24, 2023

Modular , a startup creating a platform for developing and optimizing AI systems, has raised $100 million in a funding round led by General Catalyst with participation from GV (Google Ventures), SV Angel, Greylock and Factory. times faster versus on their native frameworks, Lattner claims. . ” Ambitious much?

Tools

Tools Technical Cofounder Hardware Machine Learning

Securing AI Infrastructure for a More Resilient Future

Palo Alto Networks

OCTOBER 30, 2024

Indeed, many of the same governments that are actively developing broad, risk-based, AI regulatory frameworks have concurrently established AI safety institutes to conduct research and facilitate a technical approach to increasing AI system resilience.

Security

Security Artificial Inteligence Infrastructure Government

Hungry for resources, AI redefines the data center calculus

CIO

AUGUST 2, 2024

The AI revolution is driving demand for massive computing power and creating a data center shortage, with data center operators planning to build more facilities. But it’s time for data centers and other organizations with large compute needs to consider hardware replacement as another option, some experts say.

Data Center

Data Center Resources Data Hardware

Top 6 Annotation Tools for HITL LLMs Evaluation and Domain-Specific AI Model Training

John Snow Labs

APRIL 29, 2025

In the era of large language models (LLMs)where generative AI can write, summarize, translate, and even reason across complex documentsthe function of data annotation has shifted dramatically. What was once a preparatory task for training AI is now a core part of a continuous feedback and improvement cycle.

Artificial Inteligence

Artificial Inteligence Training Tools Generative AI

The case for predictive AI

CIO

OCTOBER 16, 2023

According to Accenture , nearly 75% of companies have already integrated AI into their business strategies, and 42% said that the return on their AI initiatives exceeded their expectations (only 1% said the return didn’t meet expectations). To learn how Rocket Software can help you modernize without disruption, click here.

Artificial Inteligence

Artificial Inteligence Artificial Intelligence Machine Learning Hardware

This week in AI: Amazon ‘enhances’ reviews with AI while Snap’s goes rogue

TechCrunch

AUGUST 19, 2023

So until an AI can do it for you, here’s a handy roundup of the last week’s stories in the world of machine learning, along with notable research and experiments we didn’t cover on their own. This week in AI, Amazon announced that it’ll begin tapping generative AI to “enhance” product reviews.

Systems Review

Systems Review Software Review Artificial Inteligence Weak Development Team

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

OctoML raises $85M for it for its machine learning acceleration platform

Webinars

Trending Sources

Dulling the impact of AI-fueled cyber threats with AI

Webinars

Stability AI backs effort to bring machine learning to biomed

Reduce ML training costs with Amazon SageMaker HyperPod

Gartner projects major IT spending increases for 2025

9 IT skills where expertise pays the most

Together raises $20M to build open source generative AI models

Timekettle’s $699 translation hardware handles multiple languages at once

Preparing the foundations for Generative AI

AI agents loom large as organizations pursue generative AI value

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

A secure approach to generative AI with AWS

CoreWeave, a GPU-focused cloud compute provider, lands $221M investment

Putting AI to Work: Generative AI Meets the Enterprise

Unlocking Innovation: AWS and Anthropic push the boundaries of generative AI together

From edge to cloud: The critical role of hardware in AI applications

Host concurrent LLMs with LoRAX

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

How Cisco accelerated the use of generative AI with Amazon SageMaker Inference

Learn how Amazon Ads created a generative AI-powered image generation capability using Amazon SageMaker

The Rise Of AI-Powered Robotics, And The Future Of Work

7 tech trends that have changed the tech landscape in 2023

Best practices to build generative AI applications on AWS

AI on the mainframe? IBM may be onto something

The AI continuum

Welcome to a New Era of Building in the Cloud with Generative AI on AWS

3 steps to get your data AI ready

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

Beyond ChatGPT: Secret robotics plans and the $38 billion humanoid revolution

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Intel spins off enterprise AI company Articul8 with outside funding

10 highest-paying IT skills for 2024

Generative AI foundation model training on Amazon SageMaker

Does your SMB have the foundation in place for GenAI?

How enterprises can navigate ethics and responsibility of generative AI

Modular secures $100M to build tools to optimize and create AI models

Securing AI Infrastructure for a More Resilient Future

Hungry for resources, AI redefines the data center calculus

Top 6 Annotation Tools for HITL LLMs Evaluation and Domain-Specific AI Model Training

The case for predictive AI

This week in AI: Amazon ‘enhances’ reviews with AI while Snap’s goes rogue

Stay Connected