Blogging data since 1886

Author: Paavo Pohndorff Page 1 of 4

A Data Science consultant working at Sopra Steria. He occasionally blogs about data and related topics here and is the host of the Dortmund Data Science Meetup.

Dover Harbour

Bring Data Science Activities to the Cloud

This document is a primer towards creating an environment in which you can create and deploy your Data Science projects.

Defining Data Science Using the Common Crawl Web Corpus – 1

Recently I finally finished one of my major projects (my thesis). So now I have some spare time for doing smaller projects after work. Like this one:

SHORT – Dealing with embedded nul in string manipulation with R

The past hours I’ve been ramming my head into the same problem over and over. I had to deal with multiple strings of hexadecimal values coming from multiple sources. So far so easy, just use the iconv package… no, does not work at all for specific strings.

Particulates – Getting Hands Dirty with R and Leaflet

So I’ve been away very often and when I was home I’ve been pretty busy with work for the last few weeks. In the last few days I finally had some rest, especially after the last week in Czech Republic, where I did part of my data exploration for a recent Data Science project on Fraud Detection in SMS (project is still work in progress). But let’s get back to the topic and talk about R and Leaflet and mapping Open Data.

SHORT – Big Data vs Data Science vs Internet of Things

Big buzzwords have always been, will always be and nobody might fully understand them because everyone has a different perspective and scope on it.

Page 1 of 4

Powered by WordPress & Theme by Anders Norén