Check my posts:

June 12, 2024 54 min read

June 12, 2024 • 54 min read

Linux eBPF : Understanding Its Kernel-Level Mechanics

My first contact with eBPF was reading about a networking tool for Kubernetes called Cilium. They advertised interesting performance improvements compared to similar tools, and it was all thanks to eBPF: a technology that allows us to run small programs inside the Linux Kernel. This short description gave me a high-level view that allowed me to explore, without really understanding how it worked (which is fine).

April 20, 2024 40 min read

April 20, 2024 • 40 min read

A Robust Environment for Building, Testing, and Developing the Linux Kernel

In my previous post, I talked about my studies on Kernel Development with a simpler Unix-like OS, called xv6. After finishing the MIT course 6.S081 Operating System Engineering (available online), I decided it was time to learn how to build the Linux Kernel. I joined a free software group (FLUSP) at the University of São Paulo (USP) and started taking a course on Free Software and Linux Kernel Development. In this post, I’ll detail the environment setup I utilized for creating and testing my first contributions to the Linux Kernel.

January 25, 2024 8 min read

January 25, 2024 • 8 min read

Compiling and Debugging a Kernel - Xv6 for the RISC-V Architecture

After years of working in high-level development, I decided it was time to delve deeper. I decided to learn more about how Operating Systems are built. This is my inaugural article on the topic, and it covers how to compile the necessary tools and run the XV6 kernel for a RISC-V machine/emulator.

May 7, 2023 12 min read

May 7, 2023 • 12 min read

Lightweight ETLs For Larger Than RAM Tables With Polars And ConnectorX

In my last blog post, I shared the latest tool that I have added to my arsenal - Apache Arrow (or the libraries based on it). In this post, I will delve deeper into the topic and demonstrate some of the techniques I employed.

February 14, 2023 8 min read

February 14, 2023 • 8 min read

How I Decreased ETL Cost by Leveraging the Apache Arrow Ecosystem

In the field of Data Engineering, the Apache Spark framework is one of the most known and powerful ways to extract and process data. It is well-trusted, and it is also very simple to use once you get the infrastructure set up. Understandably, most engineers will choose it for every task. However, in a lot of ways, it can be overkill. And a very expensive one.

August 1, 2019 1 min read

August 1, 2019 • 1 min read

Going Fast With Go - an introduction to Golang (slides)

This is a (small) presentation I gave to my coworkers at CEPESC. At the time, most of the software there was written in Java, and our team (researchers from the Federal University of Brasilia) was developing in Python. I was advocating using Golang in some performance-critical areas of our system.

August 31, 2018 6 min read

August 31, 2018 • 6 min read

Writing LaTeX Documents In Visual Studio Code With LaTeX Workshop

If you want to write LaTeX on your machine, VS Code is a great option for you! Installing all the necessary packages is a simple process. And with the power of Git, you can sync with web-based editors like Overleaf, and have satisfying versioning and backup.