Webdeveloper | Data Scientist

I am Martin. I am a techie. And they call me smart.

Work &

Education

2020

See what my approach of analysing a WhatsApp chat with 2 years of messages looks like. Using Python, Pandas and Matplotlib to generate statistics of a very active chat group. Read more »

Data Science | WhatsApp | Python

2020

The process of development and creating an easy-to-use personal finance application isn't easy. Started many years ago as an Excel sheet, now using vue.js and Google Firebase for a prototype. Read more »

Webdevelopment | Finance App | Single Page Application

2016 - 2020

Creating modern and engaging web-based trainings for international clients. A self developed framework helped us for efficient automatisation and a good benefit-cost ratio.

Webdevelopment | E-Learning | Berlin

2019

Creating a smart notification solution for a game called Civilization VI. Using webhooks and a simple PHP script, offers a nice and automated push-notification solution. Read more »

Webdevelopment | Push Notifications | PHP

2010 - 2014

Studying the basics of theoretical computer science and learning on a practical level. Multimedia, webdevelopment, designing information systems and as well as virtual reality were part of it.

Bachelor of Science | Media and Computer Science | Leipzig

Skills

Webdevelopment
Data Science
Tools

Mindset

Commitment. Communication. Creativity. Goal Driven. Form Follows Function. Innovation. Just Do It. Problem Solving. Transparency.

Personality

Balanced. Cooperative. Disciplined.Empathetic. Forgiving. Friendly. Honest. Humorous. Optimistic. Passionate. Social.

Easter Eggs

Berlin. 29. Movies. Music. Games. VR. Travel. Photography. Bike. Bars. Food. Learn. Animals. Dream. Germany. Europe.

Custom made with ❤ by smartin

Civilization Webhook Script

The Reason

Civilization VI is a round based strategy game, which I play sometimes with a couple of friends. It offers the possibility to play a game asynchronously via cloud.

Screenshot of the game

The Problem

We use a WhatsApp group to tell the other players to take their turn, but it got really annoying after a couple of messages. So I thought about some optimization.

The Solution

Fortunately, the game offers a webhook option. Everytime a player finishes his turn, a request is made to the inserted URL. First, I used an e-mail service (IFTTT) to send everybody an email. To my own surprise, there was a huge delay between finishing the round and receiving the email. So I wrote a simple PHP script which is called via the POST request from the game. The retrieved JSON is written to a text file and a website displays who’s next. Now, everybody was able to add a bookmark on the start screen of the phone and visit the site to see who is next.

screenshot Simple website with the recieved data

The Smart Solution

But I just wasn’t happy enough. Living in the 20th century, there has to be a better way. I decided to use the Google Firebase Cloud Messaging to send a notification to our smartphones, everytime a player finished their round. For that I wrote a script, which calls the Google FCM HTTP Server with the notification tokens of the specific devices to send the messages. And all it takes now, is to allow notifications for the website. No unnecessary chatting and website visiting.

screenshot Push notification via Chrome

Personal Finance (Demo)-App

What is it about?

I always took a loose track of my daily expenses, e.g. Food and Mobility. At first I just wanted to see how much money I have left at the end of the month and where I can tweak my behaviour or save a little bit of money. But I also started to think of making use of it. The lack of a good and easy-to-use app gave me the idea to start developing my own “personal finance” app.

Example screenshot of the app

Disclaimer

It is still an early build which isn’t finished. Right now it is just a prototype and playground in UI/UX design, coding and trying out new technologies. Please continue reading to see what I plan and how it came that way.

At the Beginning there was Excel

There was a time, with the lack of internet and probably a big portion of boredom, I started tracking my money and making some “fancy” Excel sheets. It was really tedious to enter my data into an Excel sheet all the time and when I was on the road, I needed to remember them until I had access to my Excel sheet. After a year of taking daily notes of my income and expense, and never being really happy about Excel as a tool, I started thinking of developing a simple website for me.

screenshot Simple website with the recieved data

First private Test Version

In my early versions I simply used PHP and SQL for the backend, and plain HTML/CSS/JavaScript with some libraries like d3.js for displaying the data. It was quite fun to try different ways of implementing the same functionality. I wanted to use node.js for the backend, but my domain contract only allowed PHP7+. Using a relational database was comfortable, but not challenging because I used it multiple times before.

And I wasn’t happy either, using “older” technologies like jQuery or PHP and no user-friendly UI/UX. On my daily work I started using vue.js and the Google Cloud Services, so I thought of a complete rework and step up the game.

The "Final" Version

So for the first time I did some mockups and realized what I really wanted to implement:

A Login / Signup Page and user accounts for the backend
A simple and “less clicks as possible” page for entering the data
A page with some nice looking charts for displaying the data visually
Using a NoSQL database with Google Firebase (Even though NoSQL isn’t the best use-case here)
Using vue.js as a single-page-application framework
Making it “Progressive Web App”-ready

This was the simplest application I came up with, which is doable in a short amount of time. Nonetheless, I needed nearly a month to develop a bug free version, but using vue.js and Google Firebase as Cloud Service still saved a lot of time. Vue is great and my favourite frontend framework out there and Google offers nice features which takes applications on track fast. As I already said, I underestimated the development time for that simple version, but having a solid base to rely on where I can add things later on really easily is a real benefit. Below you see the component chart.

screenshot Green and grey boxes are in general simple components. Yellow highlights the two main pages. Green boxes are the “view”-components and grey are reusable components which display nothing but have an important functionality. Red are external modules / libraries and blue is my external helper JavaScript file. Inside the components are the method names and used variables, so I can remember them better.

screenshot And here is the “final” design of the two main pages

The Future Version

When I find the time to work on it again, the first thing will be a reworked and unified UI. It is still a little too much and sometimes not clear enough where to navigate and enter your values. E.g. the charts are kind of nice, but not necessary and show no real value at the moment. Refactoring the code is also necessary. Adding an introduction screen, notifications when you forgot to enter something, and making the app learn and predict user inputs is on the To-Do list. Scanning bills and barcodes so that you don’t need to enter anything is a dream which will come true in #2030.

WhatsApp Chat Analysis

The Reason

I have an increased interest in analysing data and what information you can extract from it. While it was part of my further education in early 2020 I was in need of a side project. (While my further education of the matter in early 2020 I was in need of a side project.) So I came up with the idea of analysing a very active WhatsApp group of mine. After two years of chatting, there is enough data so I can refer to it easily and make it the best use-case to learn new things.

Disclaimer

I know that “hacking into WhatsApp” and analysing chat messages might not be the most legal thing, but as a private project and nobody else having access to the data, this should be fine.

The Problem: Getting the Data

Before I was even able to start, the biggest issue was to get the data in first hand. In Germany WhatsApp doesn’t allow any way of exporting the messages anymore. So I asked a friend if he has a backup of the chat. Fortunately he exported a text file while that was still possible. Unfortunately, it is only one of the two years. So I read that there is no other way than decrypting the local message.db file on my phone to get the chat messages stored inside the database file. For that I needed the database key from the phone, but extracting that key requires a rooted phone though. I didn’t want to root my current phone, so I used my old HTC One M7 to give it a try. WhatsApp uses one database key per phone number so I was fine with switching devices.

Because I have never done that before I always thought it is a difficult thing to pull off, but I successfully rooted my phone in a few hours. Then I changed my WhatsApp account temporarily to my old phone, grabbed the database key, copied all the files to my PC and decrypted the database file with it.

The Struggle: Clean up the Data

I was happy that I have all the messages of the years. Unluckily for me, I had two different sources of data. One text file from my friend of the first year and a JSON-based file from the database for another year. And the way the messages and information inside the text file are stored didn’t match with the JSON file.

screenshot The text file without any database-like format The original extracted JSON file from WhatsApp

So I wrote a Python script which converts the text file into a JSON file, which in turn matches the style and information of the one I extracted via the message.db. There was a lot of regex, working with strings, try and fail with timestamps (puh, annoying) and encoding/decoding “emoji-characters”.

The Achievement: The First Plots and a Fun Fact

After merging both JSON files to a single file I started analyzing my data. Therefore I used the pandas library for manipulating the data via so called dataframes and the matplotlib library for displaying the diagrams.

screenshot Cumulated messages per minute over 2 years

This is a representation of the cumulated messages per minute over the last two years. It doesn’t represent the actual messages per minute, e.g. with an average. With the amount of data it is easily to see some behaviour:

The most messages were between 8am and 19pm.
There are uprising spikes in the orange curve between 5am and 8am, which I would assume is the time somebody new wakes up and responds to an earlier message.
In the density lines in the background, you can see a gap somewhat after 9am, which I interpret is the time when you grab a coffee or have a daily meeting in the office.
There is also a spike in the evening, after that the activity decreases rapidly.
Summary: Over the last two years there were around 40 messages a minute in the main time of the day.

This one shows the average message count, either per 30 seconds (blue dots) or an average (orange line). And this one corresponds nicely with the cumulated chart before. Analysing the information of the blue dots, gives you the fact that there are way more “smaller messages” per 30 seconds. Why there is a gap between 2 and 2 ½ I can only guess. Maybe if there is already more activity, others join more frequently?

screenshot Average amount of messages per 30 seconds

But why 30 seconds as a time interval? Well, we (the humans) tend to check our messages frequently and respond to them quickly and more often. If you increase the timespan, the information that there are indeed more messages would be lost. In the end it is still the same result on average, but you lose some important information. See below.

screenshot Average amount of messages per 1 Minute

Fun Fact: Now I wanted to know how long it would take to read through the complete chat. After some research I found that 10 words are approximately 50 characters, and reading 500 words takes around 228 seconds. Which means it takes 0,456s reading one word.

So I calculated the average length of a message in our chat, which is 50 characters or 10 words. Due to the fact of messages with only emojis or skippable content, I estimated an average reading time of three seconds per message.

The whole chat contains 90154 characters, which are around 18.030 words and 54090 seconds or 15h to read through the whole chat.

The Future: Jupyter Notebook and more Analysis

As this process is quite time consuming and just a side project, I haven’t got any further. For the future I will make an interactive chart with Jupyter Notebook using the UI widgets. I also plan on doing a language analysis after I have done the meta and numerical work.

screenshot All the messages per day with the light blue line and two average lines for testing purposes. The 2 vertical orange lines are simply the yearly anniversary.

I like you too.