Case study

Migration of data to a new Data Lake environment

Industry
Manufacturing
Cooperation period
2018 – present
Czarno-białe zbliżenie rzędu zaparkowanych samochodów, skupienie na przedniej części eleganckiego auta z wyraźnie widocznym reflektorem i felgą, rozmyte tło kolejnych pojazdów.

About client

Volkswagen Group Polska has been the leader of the new car market in Poland since its inception. Being an importer of seven brands: Volkswagen, Volkswagen Commercial Vehicles, SKODA, SEAT, CUPRA, Audi and Porsche, the company recorded a share of 25% in the Polish passenger car market in 2021.

The Group is heavily involved in the development of electric mobility: it offers the latest generation of electric cars, creates public awareness in this area and supports the development of charging infrastructure in Poland. At the same time, the company is active in the distribution of spare parts and accessories.

A global leader in the automotive industry, combining innovation, reliability, and timeless design.

About project

The aim of the project was to build a Data Lake environment based on the Cloudera 6 technology stack, along with subsequent maintenance and implementation of changes.

“We started our cooperation with Volkswagen Group Polska by having a joint conversation about the Client’s needs. We listened to the requirements and presented possible options for data migration to the new Data Lake environment. Our priority was to be as flexible as possible both about the Client’s needs and processes.”

Tomasz Mirowski

Chief Technology Officer w 3Soft

Solution

The design work was divided into successive stages, which enabled an orderly and effective implementation of the changes.

The first step was to migrate the existing environment to a new architecture designed for this purpose, built on the Cloudera 6 technology stack. The next task was to build and maintain data flows to the target, already existing Data Platform.

Implementation and development

The full implementation of the project at the Client – launching the Platform based on a new technology stack and preparing the initial data loading processes – took 6 months.

We began our cooperation with Volkswagen Group Polska by analyzing source data and file formats to create optimal data transformation processes. Our experts acted with full flexibility and in accordance with the Client’s internal processes.

Then, based on Apache NiFi and Spark components, we prepared data flows from various sources, e.g.: CSV, XML, JSON flat files, Excel formats and Orcale and MSSQL databases, to the existing Data Platform.

At the current stage, we provide support for the architecture built on the Cloudera 6 technology stack in accordance with the adopted SLA and implement the necessary changes reported by the Client.

“We were looking for a company experienced in building Data Lake environments. Already during the initial talks, 3Soft’s representatives offered possible solutions and recommendations for the migration, and then implemented the project as agreed. It was very important to us that the experts from 3Soft approached our processes in a fully flexible manner, and the involvement of people on our side in the migration process was minimal – it mainly came down to making key decisions and clarifying any doubts.”

Michał Lazarowicz,

P.O. Kierownika ds. Systemów Raportowych i BI w Volkswagen Group Polska

Migration of data in Volkswagen Group Polska

Ikonka przepływu

Building and maintaining
data flows

Ikonka środowisko

Building a Data Lake environment based on the Cloudera 6 technology stack

Ikonka zegar

Full project implementation
took 6 months

Contact

Let’s talk

We’re eagerly waiting for
a message from you!

Contact form

Formularz kontaktowy ENG

Detailed information on the processing of personal data is available in the Privacy Policy.