Architecture, AWS and Metrics

Building Resilient Public Networking on AWS: Part 4

Xebia

OCTOBER 23, 2024

Region Evacuation with static anycast IP approach Welcome back to our comprehensive "Building Resilient Public Networking on AWS" blog series, where we delve into advanced networking strategies for regional evacuation, failover, and robust disaster recovery. Find the detailed guide here.

AWS

AWS Network Software Review Lambda

Can serverless fix fintech’s scaling problem?

CIO

FEBRUARY 11, 2025

The built-in elasticity in serverless computing architecture makes it particularly appealing for unpredictable workloads and amplifies developers productivity by letting developers focus on writing code and optimizing application design industry benchmarks , providing additional justification for this hypothesis. Architecture complexity.

Serverless

Serverless Architecture Microservices Scalability

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning - AI

NOVEMBER 15, 2024

With the QnABot on AWS (QnABot), integrated with Microsoft Azure Entra ID access controls, Principal launched an intelligent self-service solution rooted in generative AI. Principal also used the AWS open source repository Lex Web UI to build a frontend chat interface with Principal branding.

Generative AI

Generative AI AWS Groups Artificial Inteligence

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning - AI

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. You can use AWS services such as Application Load Balancer to implement this approach. You can also bring your own customized models and deploy them to Amazon Bedrock for supported architectures.

Generative AI

Generative AI AWS Enterprise Artificial Inteligence

Monitoring AWS Container Environments at Scale

Advertiser: Datadog

Particularly well-suited for microservice-oriented architectures and agile workflows, containers help organizations improve developer efficiency, feature velocity, and optimization of resources. Key metrics to monitor when leveraging two container orchestration systems.

AWS

Why GreenOps will succeed where FinOps is failing

CIO

FEBRUARY 4, 2025

The result was a compromised availability architecture. For example, the database team we worked with in an organization new to the cloud launched all the AWS RDS database servers from dev through production, incurring a $600K a month cloud bill nine months before the scheduled production launch. Standardized metrics.

Sustainability

Sustainability Technical Review Architecture Fractional CTO

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning - AI

NOVEMBER 13, 2024

The following diagram illustrates the solution architecture: The steps of the solution include: Upload data to Amazon S3 : Store the product images in Amazon Simple Storage Service (Amazon S3). The AWS Command Line Interface (AWS CLI) installed on your machine to upload the dataset to Amazon S3.

AWS

AWS Engineering Serverless eCommerce

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

CIO

JANUARY 7, 2025

It prevents vendor lock-in, gives a lever for strong negotiation, enables business flexibility in strategy execution owing to complicated architecture or regional limitations in terms of security and legal compliance if and when they rise and promotes portability from an application architecture perspective.

Cloud

Cloud Strategy Architecture Policies

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning - AI

OCTOBER 29, 2024

Observability refers to the ability to understand the internal state and behavior of a system by analyzing its outputs, logs, and metrics. Security – The solution uses AWS services and adheres to AWS Cloud Security best practices so your data remains within your AWS account.

Generative AI

Generative AI Applications AWS Knowledge Base

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

AWS Machine Learning - AI

OCTOBER 16, 2024

The general architecture of the metadata pipeline consists of two primary steps: Generate transcriptions of audio tracks: use speech recognition models to generate accurate transcripts of the audio content. Word information lost (WIL) – This metric quantifies the amount of information lost due to transcription errors.

Media

Media Video Artificial Inteligence Generative AI

Ardoq, the enterprise architecture startup, raises $125M to help organizations make sense of their networks

TechCrunch

MARCH 9, 2022

As organizations continue to build out their digital architecture, a new category of enterprise software has emerged to help them manage that process. “Enterprise architecture today is very much about the scaffolding in the organization,” he said. This means that you can also then run, for example, scenario analysis.

Architecture

Architecture Enterprise Network Organization

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. Review the model response and metrics provided.

Generative AI

Generative AI Artificial Inteligence AWS Serverless

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 1, 2024

Organizations can now label all Amazon Bedrock models with AWS cost allocation tags , aligning usage to specific organizational taxonomies such as cost centers, business units, and applications. By assigning AWS cost allocation tags, the organization can effectively monitor and track their Bedrock spend patterns.

Generative AI

Generative AI AWS Artificial Inteligence Budget

AWS launches no-code service AppFabric with generative AI assistance

CIO

JUNE 28, 2023

Amazon Web Services (AWS) on Tuesday unveiled a new no-code offering, dubbed AppFabric, designed to simplify SaaS integration for enterprises by increasing application observability and reducing operational costs associated with building point-to-point solutions. AppFabric, which is available across AWS’ US East (N.

Generative AI

Generative AI AWS Artificial Inteligence Enterprise

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning - AI

MARCH 3, 2025

Hybrid architecture with AWS Local Zones To minimize the impact of network latency on TTFT for users regardless of their locations, a hybrid architecture can be implemented by extending AWS services from commercial Regions to edge locations closer to end users. Next, create a subnet inside each Local Zone.

AWS

AWS Artificial Inteligence Technical Review Systems Review

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AWS Machine Learning - AI

DECEMBER 4, 2024

SageMaker Unified Studio combines various AWS services, including Amazon Bedrock , Amazon SageMaker , Amazon Redshift , Amazon Glue , Amazon Athena , and Amazon Managed Workflows for Apache Airflow (MWAA) , into a comprehensive data and AI development platform.

Generative AI

Generative AI Applications Technical Review Software Review

Deep Vision announces its low-latency AI processor for the edge

TechCrunch

NOVEMBER 16, 2020

Hameed and Qadeer developed Deep Vision’s architecture as part of a Ph.D. “They came up with a very compelling architecture for AI that minimizes data movement within the chip,” Annavajjhala explained. In addition, its software optimizes the overall data flow inside the architecture based on the specific workload.

Weak Development Team

Weak Development Team Hardware Architecture Automotive

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

AWS Machine Learning - AI

MARCH 13, 2025

We discuss the unique challenges MaestroQA overcame and how they use AWS to build new features, drive customer insights, and improve operational inefficiencies. MaestroQA integrated Amazon Bedrock into their existing architecture using Amazon Elastic Container Service (Amazon ECS).

Generative AI

Generative AI CTO Coach AWS Artificial Inteligence

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning - AI

APRIL 10, 2025

Solution overview To evaluate the effectiveness of RAG compared to model customization, we designed a comprehensive testing framework using a set of AWS-specific questions. The following diagram illustrates the solution architecture. To do so, we create a knowledge base. For Job name , enter a name for the fine-tuning job.

Case Study

Case Study Artificial Inteligence Study Generative AI

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning - AI

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. All of this data is centralized and can be used to improve metrics in scenarios such as sales or call centers.

Generative AI

Generative AI Video Engineering Artificial Inteligence

Techniques and approaches for monitoring large language models on AWS

AWS Machine Learning - AI

FEBRUARY 26, 2024

Our proposed architecture provides a scalable and customizable solution for online LLM monitoring, enabling teams to tailor your monitoring solution to your specific use cases and requirements. Overview of solution The first thing to consider is that different metrics require different computation considerations.

Artificial Inteligence

Artificial Inteligence AWS Lambda Metrics

How BQA streamlines education quality reporting using Amazon Bedrock

AWS Machine Learning - AI

JANUARY 13, 2025

The collaboration between BQA and AWS was facilitated through the Cloud Innovation Center (CIC) program, a joint initiative by AWS, Tamkeen , and leading universities in Bahrain, including Bahrain Polytechnic and University of Bahrain. The following diagram illustrates the solution architecture.

Education

Education Report Technical Review Generative AI

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Mobilunity

DECEMBER 26, 2024

For medium to large businesses with outdated systems or on-premises infrastructure, transitioning to AWS can revolutionize their IT operations and enhance their capacity to respond to evolving market needs. AWS migration isnt just about moving data; it requires careful planning and execution. Need to hire skilled engineers?

AWS

AWS Cloud Weak Development Team DevOps

Why Every Engineering Team Should Embrace AWS Graviton4

Honeycomb

JULY 9, 2024

Two years ago, we shared our experiences with adopting AWS Graviton3 and our enthusiasm for the future of AWS Graviton and Arm. Once again, we’re privileged to share our experiences as a launch customer of the Amazon EC2 R8g instances powered by AWS Graviton4, the newest generation of AWS Graviton processors.

AWS

AWS Engineering Metrics Network

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

AWS Machine Learning - AI

MARCH 14, 2025

Although automated metrics are fast and cost-effective, they can only evaluate the correctness of an AI response, without capturing other evaluation dimensions or providing explanations of why an answer is problematic. Human evaluation, although thorough, is time-consuming and expensive at scale.

Knowledge Base

Knowledge Base Applications Artificial Inteligence Generative AI

Boost team productivity with Amazon Q Business Insights

AWS Machine Learning - AI

APRIL 9, 2025

By monitoring utilization metrics, organizations can quantify the actual productivity gains achieved with Amazon Q Business. Tracking metrics such as time saved and number of queries resolved can provide tangible evidence of the services impact on overall workplace productivity.

Weak Development Team

Weak Development Team Metrics AWS Systems Review

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

AWS Machine Learning - AI

MARCH 20, 2025

Our partnership with AWS and our commitment to be early adopters of innovative technologies like Amazon Bedrock underscore our dedication to making advanced HCM technology accessible for businesses of any size. We are thrilled to partner with AWS on this groundbreaking generative AI project. John Canada, VP of Engineering at Asure.

Generative AI

Generative AI Artificial Inteligence Metrics AWS

Reduce ML training costs with Amazon SageMaker HyperPod

AWS Machine Learning - AI

APRIL 10, 2025

To assess system reliability, engineering teams often rely on key metrics such as mean time between failures (MTBF), which measures the average operational time between hardware failures and serves as a valuable indicator of system robustness.

Training

Training Artificial Inteligence Hardware Systems Review

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

AWS Machine Learning - AI

NOVEMBER 20, 2024

To evaluate the effectiveness of a RAG system, we focus on three key metrics: Answer relevancy – Measures how well the generated answer addresses the user’s query. By implementing dynamic metadata filtering, you can significantly improve these metrics, leading to more accurate and relevant RAG responses. model in Amazon Bedrock.

Artificial Inteligence

Artificial Inteligence Applications Knowledge Base Generative AI

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning - AI

MARCH 13, 2025

Model Variants The current DeepSeek model collection consists of the following models: DeepSeek-V3 An LLM that uses a Mixture-of-Experts (MoE) architecture. These models retain their existing architecture while gaining additional reasoning capabilities through a distillation process. deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Artificial Inteligence

Artificial Inteligence AWS Machine Learning Load Balancer

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning - AI

NOVEMBER 14, 2024

This is where AWS and generative AI can revolutionize the way we plan and prepare for our next adventure. This innovative service goes beyond traditional trip planning methods, offering real-time interaction through a chat-based interface and maintaining scalability, reliability, and data security through AWS native services.

Artificial Inteligence

Artificial Inteligence Generative AI Travel AWS

Generative AI operating models in enterprise organizations with Amazon Bedrock

AWS Machine Learning - AI

JANUARY 29, 2025

Large organizations often have many business units with multiple lines of business (LOBs), with a central governing entity, and typically use AWS Organizations with an Amazon Web Services (AWS) multi-account strategy. In this post, we evaluate different generative AI operating model architectures that could be adopted.

Generative AI

Generative AI Organization Enterprise Artificial Inteligence

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

AWS Machine Learning - AI

MAY 30, 2024

Because Amazon Bedrock is serverless, you don’t have to manage infrastructure, and you can securely integrate and deploy generative AI capabilities into your applications using the AWS services you are already familiar with. AWS Prototyping developed an AWS Cloud Development Kit (AWS CDK) stack for deployment following AWS best practices.

AWS

AWS Lambda Performance Artificial Inteligence

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning - AI

MARCH 3, 2025

Tuning model architecture requires technical expertise, training and fine-tuning parameters, and managing distributed training infrastructure, among others. These recipes are processed through the HyperPod recipe launcher, which serves as the orchestration layer responsible for launching a job on the corresponding architecture.

Artificial Inteligence

Artificial Inteligence Generative AI AWS Training

AWS empowers sales teams using generative AI solution built on Amazon Bedrock

AWS Machine Learning - AI

AUGUST 26, 2024

At AWS, we are transforming our seller and customer journeys by using generative artificial intelligence (AI) across the sales lifecycle. It will be able to answer questions, generate content, and facilitate bidirectional interactions, all while continuously using internal AWS and external data to deliver timely, personalized insights.

Generative AI

Generative AI AWS Artificial Inteligence Technical Review

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

AWS Machine Learning - AI

MARCH 20, 2025

Cross-Region inference enables seamless management of unplanned traffic bursts by using compute across different AWS Regions. Amazon Bedrock Data Automation optimizes for available AWS Regional capacity by automatically routing across regions within the same geographic area to maximize throughput at no additional cost.

Data

Data Generative AI Artificial Inteligence Compliance

Yotascale raises a $13M Series B to help companies track and manage their cloud spends

TechCrunch

OCTOBER 12, 2020

Instead, you farm out your infrastructure needs to the major cloud platforms, namely Amazon AWS , Microsoft Azure and Google Cloud. So spending less on AWS or Azure would be nice for startups. Yotascale wants to add support for Azure and Google Cloud in addition to its AWS work of today, to pick an example.

Cloud

Cloud Company Azure Google Cloud

Your guide to generative AI and ML at AWS re:Invent 2023

AWS Machine Learning - AI

NOVEMBER 22, 2023

Yes, the AWS re:Invent season is upon us and as always, the place to be is Las Vegas! are the sessions dedicated to AWS DeepRacer ! Generative AI is at the heart of the AWS Village this year. You marked your calendars, you booked your hotel, and you even purchased the airfare. And last but not least (and always fun!)

Generative AI

Generative AI Artificial Inteligence AWS Machine Learning

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

AWS Machine Learning - AI

MARCH 11, 2025

DeepSeek-R1 uses a Mixture of Experts (MoE) architecture and is 671 billion parameters in size. The MoE architecture allows activation of 37 billion parameters, enabling efficient inference by routing queries to the most relevant expert clusters. For details, refer to Create an AWS account.

Artificial Inteligence

Artificial Inteligence AWS Generative AI Metrics

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

AWS Machine Learning - AI

MARCH 7, 2025

In this post, we describe the development journey of the generative AI companion for Mozart, the data, the architecture, and the evaluation of the pipeline. The following diagram illustrates the solution architecture. Data: Policy forms Mozart is designed to author policy forms like coverage and endorsements.

Generative AI

Generative AI Technical Review Insurance Policies

New Honeycomb Integrations Let You Bubble Up Lurking AWS Issues

Honeycomb

NOVEMBER 28, 2022

Today, we’re announcing the expansion of Honeycomb integrations with various AWS services. This update now covers a much wider swath of AWS services, makes it easier to integrate your AWS stack with Honeycomb, and with our new BubbleUp enhancements , you’ll be identifying and debugging hidden issues in your AWS stack faster than ever.

AWS

AWS Metrics Lambda Infrastructure

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning - AI

MARCH 5, 2025

These assistants can be powered by various backend architectures including Retrieval Augmented Generation (RAG), agentic workflows, fine-tuned large language models (LLMs), or a combination of these techniques. Generative AI question-answering applications are pushing the boundaries of enterprise productivity.

Generative AI

Generative AI Systems Review Software Review Artificial Inteligence

Microservices Architectural Design by using Spring Boot

Perficient

FEBRUARY 13, 2024

What is Microservices Architecture? Microservices Architecture Software development follows an architectural and organizational approach where small independent services communicate with each other through well-defined APIs. with DevOps tools like Jenkins with CI/CD, Docker, Ansible, Kubernetes, or other tools.

Microservices

Microservices Architecture Load Balancer MVC

The change management Informatica needed to overhaul its business model

CIO

MARCH 6, 2024

We knew we had a unique opportunity to build a new end-to-end architecture with a common AI-powered data model. And while we did put in a new architecture, the objective was to enable a true cloud and consumption business model that was profitable. The architecture was a means to get there. Today, we’re a $1.6 Today, we’re a $1.6

Change Management

Change Management Metrics Architecture Government

Building Resilient Public Networking on AWS: Part 4

Can serverless fix fintech’s scaling problem?

Webinars

Trending Sources

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Webinars

Build a multi-tenant generative AI environment for your enterprise on AWS

Monitoring AWS Container Environments at Scale

Why GreenOps will succeed where FinOps is failing

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

Navigating the cloud maze: A 5-phase approach to optimizing cloud strategies

Empower your generative AI application with a comprehensive custom observability solution

How DPG Media uses Amazon Bedrock and Amazon Transcribe to enhance video metadata with AI-powered pipelines

Ardoq, the enterprise architecture startup, raises $125M to help organizations make sense of their networks

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS launches no-code service AppFabric with generative AI assistance

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

Deep Vision announces its low-latency AI processor for the edge

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

Model customization, RAG, or both: A case study with Amazon Nova

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Techniques and approaches for monitoring large language models on AWS

How BQA streamlines education quality reporting using Amazon Bedrock

The AWS Cloud Migration Checklist Every Business Needs for a Smooth Transition

Why Every Engineering Team Should Embrace AWS Graviton4

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

Boost team productivity with Amazon Q Business Insights

Asure’s approach to enhancing their call center experience using generative AI and Amazon Q in Quicksight

Reduce ML training costs with Amazon SageMaker HyperPod

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Generative AI operating models in enterprise organizations with Amazon Bedrock

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS empowers sales teams using generative AI solution built on Amazon Bedrock

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

Yotascale raises a $13M Series B to help companies track and manage their cloud spends

Your guide to generative AI and ML at AWS re:Invent 2023

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

Accelerating insurance policy reviews with generative AI: Verisk’s Mozart companion

New Honeycomb Integrations Let You Bubble Up Lurking AWS Issues

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Microservices Architectural Design by using Spring Boot

The change management Informatica needed to overhaul its business model

Stay Connected