Skip to content
View bhavika's full-sized avatar
🍉
🍉

Organizations

@wimlds

Block or report bhavika

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

34 stars written in Scala
Clear filter

The leader in Customer Data Infrastructure

Scala 6,958 1,192 Updated Jun 4, 2025

Deploy and manage containers (including Docker) on top of Apache Mesos at scale.

Scala 4,054 836 Updated Sep 8, 2022

A Scala API for Cascading

Scala 3,524 706 Updated May 28, 2023

In-memory dimensional time series database.

Scala 3,507 322 Updated Sep 29, 2025

Breeze is/was a numerical processing library for Scala.

Scala 3,456 693 Updated Aug 29, 2024

The easy way to learn Scala.

Scala 2,638 545 Updated Sep 23, 2025

A Scala API for Apache Beam and Google Cloud Dataflow.

Scala 2,612 526 Updated Sep 29, 2025

TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning

Scala 2,271 401 Updated Sep 29, 2023

A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine

Scala 2,157 97 Updated Sep 24, 2025

GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.

Scala 1,462 439 Updated Sep 29, 2025

GeoTrellis is a geographic data processing engine for high performance applications.

Scala 1,368 362 Updated Sep 22, 2025

Spark RAPIDS plugin - accelerate Apache Spark with GPUs

Scala 933 259 Updated Sep 29, 2025

Essential Spark extensions and helper methods ✨😲

Scala 764 152 Updated Sep 14, 2025

Spark reference applications

Scala 656 339 Updated Oct 3, 2024

Qubole Sparklens tool for performance tuning Apache Spark

Scala 584 143 Updated Jun 26, 2024

Examples for High Performance Spark

Scala 519 238 Updated Sep 2, 2025

Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)

Scala 451 77 Updated Aug 8, 2025

Apache Spark training material

Scala 402 357 Updated Nov 24, 2015

Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code

Scala 295 34 Updated Jan 31, 2025

Benchmark Suite for Apache Spark

Scala 241 124 Updated Apr 12, 2023

Solution to Facebook's link prediction contest on Kaggle.

Scala 206 67 Updated Jul 31, 2012

Visualize statistics from the MOOC "Functional Programming Principles in Scala" using Scala!

Scala 202 57 Updated Mar 31, 2014

Spark Structured Streaming / Kafka / Cassandra / Elastic

Scala 183 75 Updated Feb 7, 2023

Quick summary: This code implements a spectral (third order tensor decomposition) learning method for learning LDA topic model on Spark.

Scala 105 23 Updated Jul 2, 2018

Topic Modeling on Apache Spark

Scala 94 33 Updated Mar 1, 2019

Performance optimization for Spark running on Kubernetes

Scala 90 29 Updated Aug 18, 2020

OSMesa is an OpenStreetMap processing stack based on GeoTrellis and Apache Spark

Scala 80 26 Updated Mar 15, 2022

Randomized SVD of large sparse matrices on Spark

Scala 77 22 Updated Jul 21, 2022

Spark 2.0 Scala Machine Learning examples

Scala 77 51 Updated Oct 4, 2019

Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines and apply best practices.

Scala 62 10 Updated Sep 6, 2024
Next