How to Generate Bert Embeddings Faster and More Reliably for large datasetAuthors: Ritesh Agrawal, Akshat Chandna, Serif KayaJul 25Jul 25
Published inEngineering @VaroEvolution of ML Platform @ VaroBuilding a holistic ML Platform has become more of an integration challenge. In this post we discuss the evolution of ML Platform at Varo.Oct 13, 2021Oct 13, 2021
Published inEngineering @VaroFeature Store: Challenges and ConsiderationsAuthors: Ritesh Agrawal, Brandon LeeMay 24, 2021May 24, 2021
Packaging Machine Learning Model The Right WayThis post shows how to leverage sklearn-pandas and baikal to encapsulate both transformations and scoring while generating serialized modelFeb 5, 2021Feb 5, 2021
Database Stored Procedure for Automated Changelogleveraging stored procedure to capture what’s updated in a tableJan 31, 2021Jan 31, 2021
Demystifying PyTorch: Understanding interaction between various PyTorch abstractions.PyTorch is one of the most used libraries for deep learning but is also one of the very difficult libraries to understand due to lot of…Dec 11, 2020Dec 11, 2020
Leveraging Annotation Library (PigeonXT) for Feature EngineeringOne of the first challenges in machine learning on structured data is “Feature Engineering.” It involves deciding whether to treat a…Jul 9, 2020Jul 9, 2020
AI²: Adaptive Infrastructure Using Artificial IntelligenceFrom self-driving cars to medical assistants, businesses have found a multitude of ways to leverage machine learning and artificial…Jun 1, 2020Jun 1, 2020
Sampling With Replacement In PrestoWhile analyzing an experiment data, I encountered an interesting brain teaser. I wanted to use the bootstrap method and for that needed to…Jun 12, 2019Jun 12, 2019
Data Sampling In PrestoThe importance of “Sampling” cannot be overstated. The conclusions we draw from the data as well as the quality of the machine-learned…Aug 11, 2017Aug 11, 2017