ML and Spark MLlib smoothie
When talking about BigData, there is a need in both data scientists that can tune models' parameters from R/Python packages and Java developers able to understand the built models after implementing them to Java/Scala, including Spark MLlib. Let's get acquainted with this most powerful distributed ML library, along with discussing the special aspects of using standard machine learning algorithms and Spark data structures.
Just as Charon from the Greek myths, Alexey helps people to get from one side to the other, the sides being Java and Big Data in his case. Or, in more simple words, he is a trainer at EPAM Systems. He works with Hadoop/Spark and other Big Data projects since 2012, forks such projects and sends pull requests since 2014, presents talks since 2015. His favourite areas are text data and big graphs.