JJ's World

Creating a simple REST API with FastAPI and SQLAlchemy

Sat 27 February 2021

The fourth tutorial in the Postgres related articles is an improvement on the Flask API I made last time. I will be using FastAPI to create the same API, but with the batteries included.

Read more →
Python postgres PostgreSQL SQLAlchemy FastAPI REST API

Creating a simple REST API with Flask and SQLAlchemy

Tue 19 January 2021

As a third tutorial in my Postgres related articles I am creating a basic REST API to retrieve data through Flask and SQLAlchemy from the Postgres database.

Read more →
Python postgres PostgreSQL SQLAlchemy Flask REST API

Scraping data with Scrapy and PostgreSQL

Tue 05 January 2021

To continue the experiment with PostgreSQL I made a simple crawler that retrieves data with Scrapy and stores the data in the PostgreSQL database.

Read more →
Python postgres PostgreSQL SQLAlchemy Scrapy web crawling

Setting up PostgreSQL for Python

Wed 30 December 2020

This tutorial describes how to set up PostgreSQL on Ubuntu and configure it to make it connect with Python.

Read more →
Python postgres PostgreSQL SQLAlchemy Ubuntu

Scraping data based on xpaths

Thu 24 December 2020

A quick example on how to use xpaths to find values in a website based on certain elements. TLDR: "//tr[th//text()[contains(., 'Cost(s)')]]/td" will find the td where the th of the same row contains the text Costs(s).

Read more →
Python scraping xpath requests

Getting the lead time for Jira tickets

Sat 29 August 2020

This is a simple example on how to retrieve the changelog of your Jira tickets using Python and requests and calculate the lead time.

Read more →
ETL Python Jira data extraction Pandas

Hosting a static website with IPFS

Fri 10 July 2020

In this article I will upload a static website to IPFS to get myself familiar with the steps it takes to link an .eth domain to content hosted on the distributed web.

Read more →
IPFS blockchain decentralized eth website hosting

Forward filling in Spark

Mon 25 May 2020

An exercise where I use Spark to forward fill a sparse dataset.

Read more →
notebook Python Jupyter Spark pyspark data manipulation

GraphQL with Flask and MongoDB

Fri 15 May 2020

In this project I will crawl data from a website, upload it to MongoDB and make it accessible with GraphQL, all in Python!

Read more →
Python data data-engineering MongoDB GraphQL graphene Flask

Integrating PySpark with Salesforce

Wed 22 April 2020

Another connection with PySpark that I needed for one of the projects at work. For our daily sync with Salesforce we use Python with simple-salesforce which makes it easy to pull data, but for Spark it takes a little more effort to get data out.

Read more →
Python PySpark Salesforce dataframe Spar
← Older
Newer →

I am a lead data engineer with over 15 years of working with data. I have a passion for the field of machine learning, pattern recognition, big data, blockchain and ubiquitous computing.

While I mainly work in Python, I try to experiment with different languages and frameworks when I can. Lately I have been experimenting with AWS and Terraform since apart from data skills I want to stay on top of new developments within DevOps.

I am using this page as a portfolio and showcase, cheatsheet but mainly a historical record. That is why you will mainly find shell commands, short scripts or notebooks just for myself to not reinvent the wheel.


  • dataframe
  • postgres
  • API
  • Docker
  • PySpark
  • AWS
  • Spark
  • Flask
  • VueJS
  • notebook
  • Jupyter
  • Python
  • Ethereum
  • MongoDB
  • DevOps
  • Pandas
  • shell
  • S3
  • data engineer
  • Ubuntu
  • javascript
  • testing
  • blockchain

© JJ's World | Powered by Pelican | Hosted on Cloudflare Pages | 2008 - 2022