Open in app

Sign In

Write

Sign In

Samhita Alla
Samhita Alla

812 Followers

Home

About

Jul 4

Fine-Tuning Insights: Lessons from Experimenting with RedPajama Large Language Model on Flyte Slack Data

Large language models (LLMs) have taken the world by storm, revolutionizing our understanding and generation of human-like text. These models have showcased remarkable capabilities across a range of tasks, including question-answering, chatbots and even creative writing. Naturally, like many others, I was filled with excitement to explore and experience the…

Large Language Models

12 min read

Fine-Tuning Insights: Lessons from Experimenting with RedPajama Large Language Model on Flyte Slack…
Fine-Tuning Insights: Lessons from Experimenting with RedPajama Large Language Model on Flyte Slack…
Large Language Models

12 min read


Mar 23

Analyzing COVID-19 Impact on NYC Taxis with Flyte and DuckDB

The world of data analytics is constantly evolving, with new tools and technologies emerging every day. One popular new tool is DuckDB, open-source software touted as a faster, more efficient alternative to traditional databases. So, what exactly is DuckDB, and why is it gaining so much attention in the data…

Analytics

7 min read

Analyzing COVID-19 Impact on NYC Taxis with Flyte and DuckDB
Analyzing COVID-19 Impact on NYC Taxis with Flyte and DuckDB
Analytics

7 min read


Mar 13

10-Minute Guide to Accelerate Your ML Pipeline from Data to Deployment

Learn to build a production-grade ML pipeline with Flyte, Banana, and Hugging Face — Before you train an ML model requires you to procure, prepare and analyze the data you’ll use to build and train it. When there’s additional data, you may need to repeat the steps multiple times — a process known as “retraining.” To ensure reproducibility, it may be necessary to version…

Machine Learning

9 min read

10-Minute Guide to Accelerate Your ML Pipeline from Data to Deployment
10-Minute Guide to Accelerate Your ML Pipeline from Data to Deployment
Machine Learning

9 min read


Published in

Better Programming

·Jul 21, 2022

Is Airflow the Right Choice for Machine Learning Too?

A look at the differences between ETL and machine learning tasks — Apache Airflow is an open source platform that can be used to author, monitor, and schedule data pipelines. It is used by companies like Airbnb, Lyft, and Twitter and has been the go-to tool in the data engineering ecosystem. With an increased necessity for orchestration of data pipelines, Airflow witnessed…

Programming

5 min read

Is Airflow the Right Choice for Machine Learning Too?
Is Airflow the Right Choice for Machine Learning Too?
Programming

5 min read


Published in

Union-ai

·May 26, 2022

MLOps with Flyte: The Convergence of Workflows Between Machine Learning and Engineering

So your company is building jaw-dropping machine learning (ML) models that are performant and outputting the best results. The next task is to promote the models to production. It seems easy. After all, the most complex phase — building ML models — yielded a fruitful result. And shouldn’t it be…

Mlops

11 min read

MLOps with Flyte: The Convergence of Workflows Between Machine Learning and Engineering
MLOps with Flyte: The Convergence of Workflows Between Machine Learning and Engineering
Mlops

11 min read


Published in

Towards Data Science

·May 11, 2022

Build an Event-Driven Neural Style Transfer Application Using AWS Lambda

To build a production-ready ML application and ensure its stability in the long run, we need to take care of a long checklist of requirements which include the ease with which the models could be iterated, reproducibility, infrastructure, automation, resources, memory, and so on. On top of that, we need…

Deep Learning

8 min read

Build an Event-Driven Neural Style Transfer Application Using AWS Lambda
Build an Event-Driven Neural Style Transfer Application Using AWS Lambda
Deep Learning

8 min read


Published in

Better Programming

·Feb 11, 2022

5 Open-Source Tools That Can Help You Build ML Pipelines With Ease

All production-friendly — ML isn’t all about training a K-means classifier on Iris data in a Jupyter notebook. You might want to train tons of data using a complex ML model in the presence of high-end infrastructure. Now, this cannot be done with a single tool. We need a collection of them! With…

Programming

5 min read

5 Open-Source Tools That Can Help You Build ML Pipelines With Ease
5 Open-Source Tools That Can Help You Build ML Pipelines With Ease
Programming

5 min read


Published in

Union-ai

·Dec 8, 2021

Data-Parallel Distributed Training With Horovod and Flyte

Understand how Horovod and Flyte, along with Spark and MPIOperator, simplify building robust distributed data pipelines — This blog is with reference to a talk titled “Efficient Data Parallel Distributed Training with Flyte, Spark & Horovod”, presented by Katrina Rogan and Ketan Umare at OSPOCon 2021, Seattle. To get the ball rolling, let’s understand what the title implies.

Distributed Training

6 min read

Data-Parallel Distributed Training With Horovod and Flyte
Data-Parallel Distributed Training With Horovod and Flyte
Distributed Training

6 min read


Published in

Union-ai

·Nov 11, 2021

Bring ML Close to Data Using Feast and Flyte

And handle feature-engineered data effectively in an ML pipeline — This blog is with reference to a talk titled “Self-serve Feature Engineering Platform Using Flyte and Feast,” which was presented by Ketan Umare and Felix Wang at OSPOCon 2021, Seattle. Feature engineering is one of the greatest challenges in applied machine learning. It is the process of transforming raw data…

Programming

4 min read

Bring ML Close to Data Using Feast and Flyte
Bring ML Close to Data Using Feast and Flyte
Programming

4 min read


Published in

Union-ai

·Sep 30, 2021

Meet Flyte at Hacktoberfest 2021

Open-source is growing fast, so are its contributors. At Flyte, we have had meaningful contributions from many contributors who have helped us improve Flyte’s codebase and made us think through unseen perspectives. We have always wanted to have more such contributions to expand our horizons. We have been thinking about…

Open Source

4 min read

Meet Flyte at Hacktoberfest 2021
Meet Flyte at Hacktoberfest 2021
Open Source

4 min read

Samhita Alla

Samhita Alla

812 Followers

Software Engineer and Developer Advocate @Flyte

Following
  • TDS Editors

    TDS Editors

  • Ev Williams

    Ev Williams

  • Alessandro Butler

    Alessandro Butler

  • Vihar Kurama

    Vihar Kurama

  • Tim Leonard

    Tim Leonard

See all (74)

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech

Teams