Eduardo Alvarez – Medium

Eduardo Alvarez

he/him

Published in
TDS Archive

Meta Llama 3 Optimized CPU Inference with Hugging Face and PyTorch

Learn how to reduce model latency when deploying Meta* Llama 3 on CPUs

Apr 19, 2024

Meta Llama 3 Optimized CPU Inference with Hugging Face and PyTorch

Apr 19, 2024

Transforming Financial Services with RAG: Personalized Financial Advice

Synthesize the complex web of financial strategies, regulations, and trends with a personalized financial advice RAG-based chatbot

Apr 5, 2024

Transforming Financial Services with RAG: Personalized Financial Advice

Apr 5, 2024

Transforming Manufacturing with RAG: Delivering NextGen Equipment Maintenance

Keep operations online with actionable, relevant, and effective maintenance strategies with retrieval augmented generation

Apr 3, 2024

Transforming Manufacturing with RAG: Delivering NextGen Equipment Maintenance

Apr 3, 2024

Transforming Retail with RAG: The Future of Personalized Shopping

Delivering dynamic, fresh, and timely recommendations to shoppers with retrieval augmented generation

Apr 2, 2024

Transforming Retail with RAG: The Future of Personalized Shopping

Apr 2, 2024

Published in
TDS Archive

Improving LLM Inference Latency on CPUs with Model Quantization

Discover how to significantly improve inference latency on CPUs using quantization techniques for mixed, int8, and int4 precisions.

Feb 29, 2024

Improving LLM Inference Latency on CPUs with Model Quantization

Feb 29, 2024

Published in
AWS Tip

Distributed Fine-Tuning of Stable Diffusion with CPUs on AWS

Learn how to use Hugging Face* Accelerate on Amazon Web Services (AWS)* to fine-tune Stable Diffusion

Dec 20, 2023

Distributed Fine-Tuning of Stable Diffusion with CPUs on AWS

Dec 20, 2023

Published in
TDS Archive

Retrieval Augmented Generation (RAG) Inference Engines with LangChain on CPUs

Exploring scale, fidelity, and latency in AI applications with RAG

Dec 5, 2023

Retrieval Augmented Generation (RAG) Inference Engines with LangChain on CPUs

Dec 5, 2023

AI Imposter Syndrome

Confronting the Phantom of Doubt in Tech’s Fast Lane

Nov 8, 2023

AI Imposter Syndrome

Nov 8, 2023

A Case for Operational-Centric AI

Proposing a model for understanding the evolution of AI from the perspective of engineering resource investment

Nov 2, 2023

A Case for Operational-Centric AI

Nov 2, 2023

Published in
Intel Analytics Software

Fast Prototyping of Artificial Intelligence Applications

Use Intel AI Reference Kits and Open-Source Software for Fast Prototyping

Oct 23, 2023

Fast Prototyping of Artificial Intelligence Applications

Oct 23, 2023

Eduardo Alvarez

Eduardo Alvarez

he/him

AI Performance Optimization Lead @ AMD | Working on Operational AI, Performance Optimization, Scalable Deployments, and Applied ML | ex-Intel Corp.

Following

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech