Full Stack Data Science Blog | Insights and Best Practices | inwt

Blog

2025-04-24 by Till Bethge

Predictive LLMs: Scaling, Reproducibility & DeepSeek

2025-04-17 by Till Bethge

Evaluation of the Election Forecast 2025

2025-01-22 by Till Bethge

Predictive LLMs: Can open source models outperform OpenAI when it comes to price forecasts?

2024-11-22 by Andreas Neudecker

Introducing the kafka R Package

2024-07-30 by Sarah Wagner

XGBoost vs. LLMs for Predictive Analytics

2024-06-04 by Mira Céline Klein

Optimization of Signal Detection Using Machine Learning

2024-05-08 by Till Bethge

Predictive LLMs: Can GPT-3.5 enhance XGBoost predicitions?

2024-01-08 by Mira Céline Klein & Sarah Wagner

Business Case: ESG Reporting Platform

2024-01-03 by Sarah Wagner

Demystifying Encoding in MariaDB/MySQL: Tips to prevent Data Nightmares

2023-12-08 by Sarah Wagner

Business Case: Real-time fraud detection platform

2023-12-05 by Lukas Fuchs

Business Case: Bayesian forecast model for the German federal elections

2023-12-04 by Mira Céline Klein & Milan Flach

Business Case: Customized stack for automated air pollutant forecasting in Berlin

2023-09-01 by Antonia Runge

Refactoring: An Introduction

2023-08-16 by Mira Céline Klein

Success Factors for Data Science Projects

2023-06-26 by Mira Céline Klein

Measure, compare, optimize: data-driven decisions with A/B testing

2023-05-15 by Steffen Wagner

Interpretability of AI Models with XAI

2022-10-27 by Guido Schulz

Who’s best in class? Comparing forecasting models with a Predictive Analytics Cube (PAC)

2022-08-03 by Steffen Wagner

White Paper: Customer Segmentation

2022-01-26 by Mira Céline Klein

Automated Excel Reports with Python

2022-01-01 by Steffen Wagner

White Paper: Customer Lifetime Value

2021-12-15 by Michelle Golchert

The World of Containers: Introduction to Docker

2021-11-18 by Sebastian Cattes

Pandas DataFrame Validation with Pydantic - Part 2

2021-11-09 by Sebastian Cattes

Pandas DataFrame Validation with Pydantic

2021-10-04 by Jan Blechschmidt

Understand Customer Decision Making: Discrete Choice Models with RStan

2021-08-05 by Mira Céline Klein

Code performance in R: Working with large datasets

2021-06-30 by Mira Céline Klein

Code performance in R: Parallelization

2021-06-03 by Jianyin Roachell

How to Automate a Website Image Crawler Twitter Bot

2021-05-05 by Mira Céline Klein

Code performance in R: How to make code faster

2021-04-26 by Mira Céline Klein

Code performance in R: Which part of the code is slow?

2021-03-29 by Jan Blechschmidt

Understand customer decision making: Discrete choice models in marketing

2021-03-15 by Ulrich Rendtel

The Representation of Corona Incidence Figures in Space and Time

2021-02-20 by Marcus Groß

COVID-19: Heat Map of Local 7-day Incidences over Time

2021-01-31 by Steffen Wagner

White Paper: Attribution 2.0

2021-01-13 by Jianyin Roachell

Reflecting on 2020 US Election Forecasts: 10 Takeaways for Data Scientists

2020-12-10 by Marina Wyss

Churn Analysis for Attracting and Retaining High-Value Customers

2020-11-30 by Sebastian Cattes

Python Christmas Decoration

2020-10-20 by Marina Wyss

Protecting Your Database from SQL Injections

2020-09-16 by Marina Wyss

Reinforcement Learning for Marketing: Lessons and Challenges

2020-06-15 by Michelle Golchert

Continuous Integration: Introduction to Jenkins

2020-06-15 by Sarah Wagner

Continuous Integration: What it is, Why it Matters, and Tools to Get Started

2020-03-31 by Marina Wyss

Understanding and Handling Missing Data

2020-02-13 by Andreas Neudecker

shinyMatrix - Matrix Input for Shiny Apps

2019-12-27 by Marina Wyss

Building a Strong Data Science Team from the Ground Up

2019-12-23 by Michelle Golchert

Data Visualization in R vs. Python

2019-11-19 by Marina Wyss

Debugging in R: How to Easily and Efficiently Conquer Errors in Your Code

2019-10-21 by Sebastian Cattes

Marketing Mix Modeling - How Does Advertising Really Work?

2019-09-26 by Marina Wyss

Multi-Armed Bandits as an A/B Testing Solution

2019-09-17 by Marina Wyss

Data Quality and the Importance of Data Stewardship

2019-09-09 by David Berscheid

Best Practice: Development of Robust Shiny Dashboards as R Packages

2019-07-25 by Amit Ghosh

What's the Best Statistical Software? A Comparison of R, Python, SAS, SPSS and STATA

2019-07-16 by Andreas Neudecker

Shiny Modules

2019-06-19 by Steffen Wagner

Best Practice in TV Tracking: Why a Simple Baseline Correction Falls Short!

2019-05-21 by David Berscheid

rsync as R package

2019-05-07 by Sebastian Warnholz

Using Modules in R

2019-03-25 by Steffen Wagner

ggCorpIdent: Stylize ggplot2 Graphics in Your Corporate Design

2019-01-30 by Sarah Wagner

R Markdown Template for Business Reports

2018-11-21 by Sarah Wagner

Cluster Analysis - Part 2: Hands On

2018-11-06 by Sarah Wagner

Cluster Analysis - Part 1: Introduction

2018-10-11 by Sebastian Warnholz

Optimize your R Code using Memoization

2018-09-25 by Marcus Groß

Introducing the Kernel Heaping Package III

2018-08-06 by Matthäus Deutsch

Do GPU-based Basic Linear Algebra Subprograms (BLAS) improve the performance of standard modeling techniques in R?

2018-07-13 by Marcus Groß

Introducing the Kernelheaping Package II

2018-05-28 by Jonathan Bob

Prediction: Who will win the 2018 World Cup?

2018-04-04 by Sebastian Warnholz

Design Patterns in R

2018-03-05 by Sebastian Warnholz

smoothScatter with ggplot2

2018-02-06 by Marcus Groß

Introducing the Kernelheaping Package

2018-01-25 by Mira Céline Klein

INWT's guidelines for R code

2017-12-19 by Steffen Wagner

Business Case: Predictive Customer Journey (PCJ)

2017-12-19 by Steffen Wagner

Business Case: Spillover Analysis

2017-12-12 by Sebastian Warnholz

A Not So Simple Bar Plot Example Using ggplot2

2017-11-30 by Mira Céline Klein

Business Case: Customer Segmentation

2017-11-02 by Marcus Groß

Business Case: Spatial Data Visualization

2017-11-01 by Sebastian Warnholz

Promises and Closures in R

2017-08-16 by Sarah Wagner

Plane Crash Data - Part 2: Google Maps Geocoding API Request

2017-08-16 by Sarah Wagner

Plane Crash Data - Part 3: Visualisation

2017-08-08 by Mira Céline Klein

A meaningful file structure for R projects

2017-08-01 by Sarah Wagner

Plane Crash Data - Part 1: Web Scraping

2017-03-20 by Marcus Groß

Who will win the 2017 Bundestag election?

2017-03-07 by Martin Badicke

MariaDB monitor

2017-02-15 by Sarah Wagner

100 grams of Lego, please.

2016-11-30 by Amit Gosh

Business Case: Quality Index

2016-10-31 by Steffen Wagner

Business Case: Attribution 2.0

2016-10-31 by Steffen Wagner

Business case: Staff Planning

2016-10-31 by Steffen Wagner

Business Case: Customer Lifetime Value (CLV)

2016-10-31 by Steffen Wagner

Business Case: TV Impact

2016-10-13 by Sarah Wagner

Business Case: Credit Card Fraud Detection