Hardware, Performance and Scalability

How today’s enterprise architect juggles strategy, tech and innovation

CIO

APRIL 16, 2025

enterprise architects ensure systems are performing at their best, with mechanisms (e.g. to identify opportunities for optimizations that reduce cost, improve efficiency and ensure scalability. Aggregated TCO: Evaluating the total cost across hardware, software, services and operational expenditures is key.

Technical Review

Technical Review Enterprise Strategy Innovation

9 IT skills where expertise pays the most

CIO

APRIL 25, 2025

While many have performed this move, they still need professionals to stay on top of cloud services and manage large datasets. It enables developers to create consistent virtual environments to run applications, while also allowing them to create more scalable and secure applications via portable containers.

Artificial Inteligence

Artificial Inteligence DevOps Virtualization Industry

EnCharge AI emerges from stealth with $21.7M to develop AI accelerator hardware

TechCrunch

DECEMBER 14, 2022

EnCharge AI , a company building hardware to accelerate AI processing at the edge , today emerged from stealth with $21.7 Speaking to TechCrunch via email, co-founder and CEO Naveen Verma said that the proceeds will be put toward hardware and software development as well as supporting new customer engagements.

Hardware

Hardware Development Machine Learning Artificial Inteligence

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

AI chip startup Axelera lands $27M in capital to commercialize its hardware

TechCrunch

OCTOBER 25, 2022

Unlike conventional chips, theirs was destined for devices at the edge, particularly those running AI workloads, because Del Maffeo and the rest of the team perceived that most offline, at-the-edge computing hardware was inefficient and expensive. The edge AI hardware market is projected to grow from 920 million units in 2021 to 2.08

Hardware

Hardware Machine Learning Artificial Inteligence Blockchain

Cloud analytics migration: how to exceed expectations

CIO

NOVEMBER 19, 2024

A modern data and artificial intelligence (AI) platform running on scalable processors can handle diverse analytics workloads and speed data retrieval, delivering deeper insights to empower strategic decision-making. Intel’s cloud-optimized hardware accelerates AI workloads, while SAS provides scalable, AI-driven solutions.

Analytics

Analytics Cloud How To Scalability

Can serverless fix fintech’s scaling problem?

CIO

FEBRUARY 11, 2025

Technology leaders in the financial services sector constantly struggle with the daily challenges of balancing cost, performance, and security the constant demand for high availability means that even a minor system outage could lead to significant financial and reputational losses. Scalability. Scalability. Cost forecasting.

Serverless

Serverless Architecture Microservices Scalability

Google’s AI innovations at Cloud Next 2025: What CIOs need to know

CIO

APRIL 15, 2025

Cost-performance optimizations via new chip One of the major updates announced last week was Googles seventh generation Tensor Processing Unit (TPU) chip Ironwood targeted at accelerating AI workloads, especially inferencing.

Cloud

Cloud Innovation Artificial Inteligence Google Cloud

The smart, strategic moves CIOs must make to leverage AI for business transformation

CIO

MARCH 18, 2025

These issues can hinder AI scalability and limit its benefits. Why the ideal time to shift to AI PCs is now With Windows 10 nearing end-of-support, businesses must decide whether to update their existing hardware or upgrade completely when shifting to Windows 11. Fortunately, a solution is at hand.

Business Transformation

Business Transformation Hardware Windows Study

Navigating the future of national tech independence with sovereign AI

CIO

MARCH 31, 2025

Core challenges for sovereign AI Resource constraints Developing and maintaining sovereign AI systems requires significant investments in infrastructure, including hardware (e.g., high-performance computing GPU), data centers, and energy.

Technical Review

Technical Review Artificial Inteligence Compliance Open Source

Nvidia points to the future of AI hardware

CIO

APRIL 1, 2024

And if the Blackwell specs on paper hold up in reality, the new GPU gives Nvidia AI-focused performance that its competitors can’t match, says Alvin Nguyen, a senior analyst of enterprise architecture at Forrester Research. You can have effective basic performance, but you still have that long-term scalability issue,” he says.

Hardware

Hardware Artificial Inteligence Off-The-Shelf Generative AI

Microsoft acquires Fungible, a maker of data processing units, to bolster Azure

TechCrunch

JANUARY 9, 2023

In December, reports suggested that Microsoft had acquired Fungible, a startup fabricating a type of data center hardware known as a data processing unit (DPU), for around $190 million. ” A DPU is a dedicated piece of hardware designed to handle certain data processing tasks, including security and network routing for data traffic. .”

Azure

Azure Data Center Data Storage

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

AWS Machine Learning - AI

NOVEMBER 26, 2024

there is an increasing need for scalable, reliable, and cost-effective solutions to deploy and serve these models. AWS Trainium and AWS Inferentia based instances, combined with Amazon Elastic Kubernetes Service (Amazon EKS), provide a performant and low cost framework to run LLMs efficiently in a containerized environment.

AWS

AWS Load Balancer Software Review Artificial Inteligence

How we pivoted our deep tech startup to become a SaaS company

TechCrunch

JANUARY 11, 2023

For the foreseeable future, global markets will require billions of highly specialized electric machines that perform much better than the inefficient relics of the past. Initially, we approached this as a hardware challenge until we determined that the key to meeting next-generation electric motor demand actually lies in software.

Company

Company Off-The-Shelf Hardware Software Engineering

GM invests in radar software startup Oculii as demand for automated driving features rise

TechCrunch

SEPTEMBER 13, 2021

The startup has no intention of building hardware for its auto clients (though it does work with robotics companies for whom the company does manufacture sensors, a company spokesperson said). Software fundamentally improves with better hardware in each generation that’s released.

Software

Software Hardware Scalability Company

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning - AI

DECEMBER 2, 2024

For generative AI models requiring multiple instances to handle high-throughput inference requests, this added significant overhead to the total scaling time, potentially impacting application performance during traffic spikes. We ran 5+ scaling simulations and observed consistent performance with low variations across trials.

Generative AI

Generative AI Artificial Inteligence Machine Learning AWS

The Reason Many AI and Analytics Projects Fail—and How to Make Sure Yours Doesn’t

CIO

JANUARY 20, 2023

Some are relying on outmoded legacy hardware systems. Most have been so drawn to the excitement of AI software tools that they missed out on selecting the right hardware. Dealing with data is where core technologies and hardware prove essential. An organization’s data, applications and critical systems must be protected.

Analytics

Analytics Artificial Inteligence Artificial Intelligence Hardware

Frore secures $100M, collabs with Intel to create a new way to cool processors

TechCrunch

DECEMBER 1, 2022

One of the top problems facing device manufacturers today is overheating hardware. The chips inside PCs generate heat, which — when allowed to build up — majorly hurts performance. This means consumers never really get the full processor performance they pay for. Image Credits: Frore.

Hardware

Hardware Mobile Performance System

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

The following figure illustrates the performance of DeepSeek-R1 compared to other state-of-the-art models on standard benchmark tests, such as MATH-500 , MMLU , and more. SM_NUM_GPUS : This parameter specifies the number of GPUs to use for model inference, allowing the model to be sharded across multiple GPUs for improved performance.

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Pliops lands $100M for chips that accelerate analytics in data centers

TechCrunch

AUGUST 16, 2022

Pliop’s processors are engineered to boost the performance of databases and other apps that run on flash memory, saving money in the long run, he claims. “While CPU performance is increasing, it’s not keeping up, especially where accelerated performance is critical. Image Credits: Pliops.

Data Center

Data Center Analytics Weak Development Team Data

Rigetti Computing goes public via SPAC merger

TechCrunch

OCTOBER 6, 2021

Rigetti Computing , one of the most visible quantum hardware startups, today announced that it is going public through a merger with the Supernova Partners Acquisition Company II SPAC. Once the transaction closes, Rigetti’s ticker symbol on the New York Stock Exchange will be “RGTI.”

Hardware

Hardware Scalability Groups Trends

Beyond AI: Building toward artificial consciousness – Part 2

CIO

JUNE 18, 2024

Utilizing standard 2u servers outfitted with a robust set of specifications ensures the reliability and performance needed for critical operations. This architecture integrates a strategic assembly of server types across 10 racks to ensure peak performance and scalability.

Artificial Inteligence

Artificial Inteligence Storage Artificial Intelligence Hardware

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Their DeepSeek-R1 models represent a family of large language models (LLMs) designed to handle a wide range of tasks, from code generation to general reasoning, while maintaining competitive performance and efficiency. 70B-Instruct ), offer different trade-offs between performance and resource requirements.

Generative AI

Generative AI Artificial Inteligence AWS Serverless

High-performance computing on AWS

Xebia

AUGUST 29, 2023

How does High-Performance Computing on AWS differ from regular computing? Today’s server hardware is powerful enough to execute most compute tasks. For this HPC will bring massive parallel computing, cluster and workload managers and high-performance components to the table. Why HPC and cloud are a good fit?

AWS

AWS Performance Storage Linux

From skills to performance: How hands-on learning is preparing IT teams for digital transformations

CIO

JULY 22, 2024

According to a recent Skillable survey of over 1,000 IT professionals, it’s highly likely that your IT training isn’t translating into job performance. Four in 10 IT workers say that the learning opportunities offered by their employers don’t improve their job performance. The team turned to virtual IT labs as an alternative.

Technical Review

Technical Review Performance Training Software Review

Speed Up Your Startup with Hidden Ruby on Rails 8 Features

MagmaLabs

APRIL 24, 2025

Ruby on Rails has long been a favorite for building scalable applications quicklyand with the release of Rails 8, its even more powerful. Ruby on Rails 8 introduces a range of hidden features that dramatically reduce development time while improving performance and flexibility. Thats why choosing the right tech stack is crucial.

Minimum Viable Product

Minimum Viable Product Authentication Web Development Scalability

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

AWS Machine Learning - AI

NOVEMBER 14, 2024

In this post, we explore advanced prompt engineering techniques that can enhance the performance of these models and facilitate the creation of compelling imagery through text-to-image transformations. This post provided practical tips and techniques to optimize performance and elevate the creative possibilities within Stable Diffusion 3.5

Engineering

Engineering AWS 3D Generative AI

Bodo.ai secures $14M, aims to make Python better at handling large-scale data

TechCrunch

AUGUST 25, 2021

Bodo.ai , a parallel compute platform for data workloads, is developing a compiler to make Python portable and efficient across multiple hardware platforms. Bodo.ai, headquartered in San Francisco, was founded in 2019 by Nasre and Ehsan Totoni, CTO, to make Python higher performing and production ready.

Artificial Inteligence

Artificial Inteligence Machine Learning Data Artificial Intelligence

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies, such as AI21 Labs, Anthropic, Cohere, Meta, Mistral, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

Driving Agility and Scalability through Smart Data

Cloudera

MAY 3, 2021

Cloudera sees success in terms of two very simple outputs or results – building enterprise agility and enterprise scalability. In the last five years, there has been a meaningful investment in both Edge hardware compute power and software analytical capabilities. Let’s start at the place where much of Industry’s 4.0

Scalability

Scalability Agile Data Systems Review

Modular secures $100M to build tools to optimize and create AI models

TechCrunch

AUGUST 24, 2023

Bringing Modular’s total raised to $130 million, the proceeds will be put toward product expansion, hardware support and the expansion of Modular’s programming language, Mojo, CEO Chris Lattner says. Deci , backed by Intel, is among the startups offering tech to make trained AI models more efficient — and performant.

Tools

Tools Technical Cofounder Hardware Machine Learning

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

AWS Machine Learning - AI

APRIL 30, 2025

Amazon Bedrock Model Distillation is generally available, and it addresses the fundamental challenge many organizations face when deploying generative AI : how to maintain high performance while reducing costs and latency. This provides optimal performance by maintaining the same structure the model was trained on.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

Gather AI secures new cash to scan inventory in warehouses using drones

TechCrunch

OCTOBER 6, 2022

“Off-the-shelf hardware is more reliable and proven than custom-engineered hardware and our software is drone-agnostic so we can source from a large supply chain with no factory to manage.” ” Gather isn’t the first to market with a drone-based inventory monitoring system.

Off-The-Shelf

Off-The-Shelf Hardware Artificial Inteligence Machine Learning

The success of GenAI models lies in your data management strategy

CIO

OCTOBER 9, 2024

This includes Dell Data Lakehouse for AI, a data platform built upon Dell’s AI-optimized hardware, and a full-stack software suite for discovering, querying, and processing enterprise data. In particular, Dell PowerScale provides a scalable storage platform for driving faster AI innovations.

Strategy

Strategy Data Artificial Inteligence Storage

WaveOne aims to make video AI-native and turn streaming upside down

TechCrunch

DECEMBER 1, 2020

The other major change was beginning to rely on hardware acceleration of said codecs — your computer or GPU might have an actual chip in it with the codec baked in, ready to perform decompression tasks with far greater speed than an ordinary general-purpose CPU in a phone. Mac-optimized TensorFlow flexes new M1 and GPU muscles.

Video

Video Hardware Technical Cofounder Artificial Inteligence

Intel spins off enterprise AI company Articul8 with outside funding

CIO

JANUARY 4, 2024

With its deep AI and HPC [High Performance Computing] domain knowledge and enterprise-grade GenAI deployments, Articul8 is well positioned to deliver tangible business outcomes for Intel and our broader ecosystem of customers and partners,” Intel CEO Pat Gelsinger said in a news release.

Enterprise

Enterprise Company Generative AI Telecommunications

Fast-Tracking Custom LLMs Using vLLM

InnovationM

APRIL 15, 2025

At InnovationM, we are constantly searching for tools and technologies that can drive the performance and scalability of our AI-driven products. Recently, we made progress with vLLM, a high-performance model inference engine designed to deploy Large Language Models (LLMs) more efficiently. We had a defined challenge.

Artificial Inteligence

Artificial Inteligence Performance Training Scalability

Aptiv’s latest investment shows that software-defined vehicles are here to stay

TechCrunch

FEBRUARY 3, 2022

Aptiv comes on as a strategic investor at a time when the company is working on accelerating the transition to the software-defined car by offering a complete stack to automakers, one that includes high-performance hardware, cloud connectivity and a software architecture that is open, scalable and containerized. .

Software

Software Automotive Architecture Hardware

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

AWS Machine Learning - AI

MARCH 11, 2025

For example, DeepSeek-R1-Distill-Llama-8B offers an excellent balance of performance and efficiency. By integrating this model with Amazon SageMaker AI , you can benefit from the AWS scalable infrastructure while maintaining high-quality language model capabilities.

Artificial Inteligence

Artificial Inteligence AWS Generative AI Metrics

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning - AI

NOVEMBER 26, 2024

Using vLLM on AWS Trainium and Inferentia makes it possible to host LLMs for high performance inference and scalability. We will also talk about performance tuning the inference graph. max-num-seqs 32 : This is set to the hardware batch size or a desired level of concurrency that the model server needs to handle.

Artificial Inteligence

Artificial Inteligence AWS Artificial Intelligence Generative AI

What Should You Know About Graph Database’s Scalability?

Dzone - DevOps

JANUARY 20, 2023

Having a distributed and scalable graph database system is highly sought after in many enterprise scenarios. Do Not Be Misled Designing and implementing a scalable graph database system has never been a trivial task.

Scalability

Scalability Fashion Architecture Big Data

Choosing a cloud infrastructure provider: A beginner’s guide

TechCrunch

FEBRUARY 6, 2023

The promise of lower hardware costs has spurred startups to migrate services to the cloud, but many teams were unsure how to do this efficiently or cost-effectively. These companies are worried about the future of their cloud infrastructure in terms of security, scalability and maintainability.

Infrastructure

Infrastructure Cloud Minimum Viable Product Weak Development Team

Host concurrent LLMs with LoRAX

AWS Machine Learning - AI

APRIL 16, 2025

These models are tailored to perform specialized tasks within specific domains or micro-domains. This challenge is further compounded by concerns over scalability and cost-effectiveness. They can host the different variants on a single EC2 instance instead of a fleet of model endpoints, saving costs without impacting performance.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Storage

Driving the High Performance of the InfiniBox SSA™

Infinidat

NOVEMBER 4, 2021

Driving the High Performance of the InfiniBox SSA™. Think highest-performance model that makes people’s heads turn. It is powered by Infinidat’s proven deep learning software algorithms and extensive DRAM cache, consistently delivering performance and latency results that surpass all-flash arrays (AFAs). Adriana Andronescu.

Performance

Performance Systems Review Technical Review Sport

Scalable training platform with Amazon SageMaker HyperPod for innovation: a video generation case study

AWS Machine Learning - AI

SEPTEMBER 26, 2024

To accelerate iteration and innovation in this field, sufficient computing resources and a scalable platform are essential. With these capabilities, customers are adopting SageMaker HyperPod as their innovation platform for more resilient and performant model training, enabling them to build state-of-the-art models faster.

Case Study

Case Study Video Training Scalability

How today’s enterprise architect juggles strategy, tech and innovation

9 IT skills where expertise pays the most

Webinars

Trending Sources

EnCharge AI emerges from stealth with $21.7M to develop AI accelerator hardware

Webinars

AI chip startup Axelera lands $27M in capital to commercialize its hardware

Cloud analytics migration: how to exceed expectations

Can serverless fix fintech’s scaling problem?

Google’s AI innovations at Cloud Next 2025: What CIOs need to know

The smart, strategic moves CIOs must make to leverage AI for business transformation

Navigating the future of national tech independence with sovereign AI

Nvidia points to the future of AI hardware

Microsoft acquires Fungible, a maker of data processing units, to bolster Azure

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM

How we pivoted our deep tech startup to become a SaaS company

GM invests in radar software startup Oculii as demand for automated driving features rise

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

The Reason Many AI and Analytics Projects Fail—and How to Make Sure Yours Doesn’t

Frore secures $100M, collabs with Intel to create a new way to cool processors

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Pliops lands $100M for chips that accelerate analytics in data centers

Rigetti Computing goes public via SPAC merger

Beyond AI: Building toward artificial consciousness – Part 2

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

High-performance computing on AWS

From skills to performance: How hands-on learning is preparing IT teams for digital transformations

Speed Up Your Startup with Hidden Ruby on Rails 8 Features

Understanding prompt engineering: Unlock the creative potential of Stability AI models on AWS

Bodo.ai secures $14M, aims to make Python better at handling large-scale data

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

Driving Agility and Scalability through Smart Data

Modular secures $100M to build tools to optimize and create AI models

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Gather AI secures new cash to scan inventory in warehouses using drones

The success of GenAI models lies in your data management strategy

WaveOne aims to make video AI-native and turn streaming upside down

Intel spins off enterprise AI company Articul8 with outside funding

Fast-Tracking Custom LLMs Using vLLM

Aptiv’s latest investment shows that software-defined vehicles are here to stay

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

What Should You Know About Graph Database’s Scalability?

Choosing a cloud infrastructure provider: A beginner’s guide

Host concurrent LLMs with LoRAX

Driving the High Performance of the InfiniBox SSA™

Scalable training platform with Amazon SageMaker HyperPod for innovation: a video generation case study

Stay Connected