Manuel Acosta — Senior Cloud & Data Engineer specializing in AWS data pipelines, serverless architectures, and cloud cost optimization.

I design and build cloud-native systems with a focus on data engineering, serverless infrastructure, and cloud cost reduction. AWS Certified Solutions Architect, Data Engineer, and Machine Learning Engineer with 9+ years of experience helping teams move fast without breaking things.

Let's Work Together!

I'm always open to exciting opportunities and collaborations. Whether it's consulting or project base freelance, feel free to reach out.

Contact Me

My Latest Projects

Serverless webpage

Serverless webpage

Selftaugth Cloud is a Personal portafolio website engine capable of server-side logic using Laravel and fully deployed as a serverless application runing on AWS costing less than 1$/month of recurring cost.

See project details
Embedded Amazon QuickSight Dashboards

Embedded Amazon QuickSight Dashboards

Learn to embed Amazon QuickSight dashboards into your web application using a Cloud first Serverless AWS Stack providing a flexible and powerful BI tool.

See project details

Blog Posts

Data Glossaries: The Semantic Layer That Decides Whether AI on Your Data Actually Works

May 2026
Data Glossaries: The Semantic Layer That Decides Whether AI on Your Data Actually Works

A follow-up to Data Catalog Core Concepts Explained.

In the previous post I argued that a weak glossary is the single biggest reason data catalogs become ghost towns. I want to take that claim seriously, because I keep watching teams nod at it, agree that glossaries matter, and then ship a catalog where the glossary is a folder of thirty terms named after database columns with the descriptions left blank.

There's a deeper reason this keeps happening. The data engineering profession learned how to model schemas, how to wire pipelines, how to write tests, how to draw lineage. It did not, as a discipline, learn how to model meaning. That work belongs to a different field entirely (information science), and most data teams have never been exposed to it.

This post tries to close that gap. It's a deep dive on data glossaries: what they actually are (and aren't), the three structural types you can build them as, what each one buys you, how to start without drowning, and what a realistic 12-month roadmap looks like. I'll keep referring back to the catalog post where the concepts connect.

If you're standing up a catalog in 2026 and you're serious about AI agents using it, glossary work is no longer optional. It's the layer the agents will lean on hardest, and the layer that's hardest to fake.

Data Catalog Core Concepts Explained — With an Honest Look at OpenMetadata

May 2026
Data Catalog Core Concepts Explained — With an Honest Look at OpenMetadata

There's a particular kind of pain that every data team eventually hits. A data scientist spends three days hunting for the "right" customer table. An analyst builds a dashboard on a column that was deprecated six months ago. A new hire asks where the revenue data lives, and four people give four different answers. Everyone has the data. Nobody can find it.

This is the problem data catalogs are built to solve, and in 2026, with AI agents now reading from the same warehouses humans do, solving it has gone from "nice to have" to "you cannot ship AI safely without it."

Amazon QuickSight: Implementing Row-Level Security

March 2025
Amazon QuickSight: Implementing Row-Level Security

Row-Level Security (RLS) in QuickSight ensures users only access data relevant to their roles, enhancing data confidentiality and compliance. It enables secure, multi-tenant analytics by restricting visibility at the row level. RLS is crucial for delivering personalized insights without compromising sensitive information.

About Selftaugth Cloud page

February 2025
About Selftaugth Cloud page

Starting today, I’m committing to sharing my cloud engineering journey every week on Self-Taught Cloud, documenting lessons, deep dives, and hands-on experiences with AWS, Kubernetes, Data and more

Great news on AWS reinvent 2024 event!

December 2024
Great news on AWS reinvent 2024 event!

AWS has recently introduced several enhancements across its services, aiming to improve data integration, scalability, and security. Here are ten announcements that caught my attention, focusing on integrations and services I already use.