Machine Learning Data Pipelines with Kafka and Tensorflow

Track: Architecture
An intense presentation designed to operationalize machine learning. This talk focuses on dividing specializations; data engineer and data scientist. The data engineer ensures that data is delivered, manipulated, and harnessed. The data engineer does this to be useful for the data scientist. The data engineer is also versant in Java and Scala and will be knowledgeable in pub-subs like Kafka. The data scientist uses that data, does their cleaning, and investigates possible patterns designing a machine learning model that we can use to either find regressions or classifications for our data. The data scientists use Python, Jupyter Notebooks, Tensorflow, and Matplotlib as their tools of choice for constructing a machine-learning model to make decisions about the data. This presentation answers the question. How do we take that model and tie it to everything else? This workshop will use a wide array of technologies. It will set you on the path to running Machine Learning Pipelines in Kubernetes using Kafka and Tensorflow, so you can start immediately when you return to work.
Daniel Hinojosa
Daniel Hinojosa is a programmer, consultant, instructor, speaker, and author. With over 20 years of experience, he does work for private, educational, and government institutions. Daniel loves JVM languages like Java, Groovy, and Scala; but also works with non-JVM languages like Haskell, Ruby, Python, LISP, C, C++. He is an avid Pomodoro Technique Practitioner and makes every attempt to learn a new programming language every year. Daniel is the author of Testing in Scala and the video of Beginning Scala Programming Video Series for O’Reilly Publishing. For downtime, he enjoys reading, swimming, Legos, football, and cooking.