Home | JJ's World

Setting up PostgreSQL for Python

Wed 30 December 2020

This tutorial describes how to set up PostgreSQL on Ubuntu and configure it to make it connect with Python.

Python postgres PostgreSQL SQLAlchemy Ubuntu

Scraping data based on xpaths

Thu 24 December 2020

A quick example on how to use xpaths to find values in a website based on certain elements. TLDR: "//tr[th//text()[contains(., 'Cost(s)')]]/td" will find the td where the th of the same row contains the text Costs(s).

Python scraping xpath requests

Getting the lead time for Jira tickets

Sat 29 August 2020

This is a simple example on how to retrieve the changelog of your Jira tickets using Python and requests and calculate the lead time.

ETL Python Jira data extraction Pandas

Hosting a static website with IPFS

Fri 10 July 2020

In this article I will upload a static website to IPFS to get myself familiar with the steps it takes to link an .eth domain to content hosted on the distributed web.

IPFS blockchain decentralized eth website hosting

Forward filling in Spark

Mon 25 May 2020

An exercise where I use Spark to forward fill a sparse dataset.

notebook Python Jupyter Spark pyspark data manipulation

GraphQL with Flask and MongoDB

Fri 15 May 2020

In this project I will crawl data from a website, upload it to MongoDB and make it accessible with GraphQL, all in Python!

Python data data-engineering MongoDB GraphQL graphene Flask

Integrating PySpark with Salesforce

Wed 22 April 2020

Another connection with PySpark that I needed for one of the projects at work. For our daily sync with Salesforce we use Python with simple-salesforce which makes it easy to pull data, but for Spark it takes a little more effort to get data out.

Python PySpark Salesforce dataframe Spar

Integrating PySpark with SQL server using JDBC

Wed 22 April 2020

In my series on connecting different sources to Spark I have explained how to connect to S3 and Redshift. To further extend my trials I show a quick demo on how to connect to a SQL server using JDBC.

Python PySpark SQL dataframe Spark

Using Powerlevel10K as Zsh theme

Sun 29 March 2020

Previously I was using powerlevel9k as theme for my iTerm2 Zsh configuration. Recently I had to install a new MacBook and found an easier way to make the terminal look fancier. powerlevel10k is the better version of powerlevel9k, especially since it has a configuration prompt where the installer guides you through all the changes you can make to the style.

For Mac it is as simple as the following few lines, assuming you have brew installed.

$ brew install romkatv/powerlevel10k/powerlevel10k
$ echo 'source /usr/local/opt/powerlevel10k/powerlevel10k.zsh-theme' >>! ~/.zshrc
$ p10k configure

zshell command line iterm2 styling

Using parametrize with PyTest

Fri 21 February 2020

Using parametrize writing tests becomes significantly easier. Instead of writing a test for each combination of parameters I can write one test with a list of different sets of parameters. A short example..

Python pytest testing parametrize