Database replication in Hadoop
Nowadays you can find lots of information devoted to working with data in Hadoop, starting from traditional SQL engines till machine learning and stream processing. So, we've successfully prototyped the solution and showed to our customer all the advantages you can get due to data processing on Hadoop. Now it's time for simple infrastructure tasks to come to the fore: loading data to Hadoop on a regular basis, ensuring access control, along with what to do in case of failure and so on.
In this talk, we'll focus on one task — loading data into Hadoop for later analysis.
Has been working on the direction of Big Data in Sberbank Technology since the very first day it was organized; created the department dealing with the development of the Big Data Sberbank platform from scratch. Actively takes part in the design of the solution architecture. Aside from being a developer, is also a part of Data Science direction, where he leads the initiatives based on Big Data.