
Lists (5)
Sort Name ascending (A-Z)
- All languages
- C
- C#
- C++
- CMake
- CSS
- Clojure
- Common Lisp
- Crystal
- Cuda
- Cython
- D
- Dockerfile
- Emacs Lisp
- Erlang
- Go
- HCL
- HTML
- Handlebars
- Haskell
- Java
- JavaScript
- Jsonnet
- Julia
- Jupyter Notebook
- Kotlin
- Lua
- MATLAB
- MDX
- Makefile
- Nix
- OCaml
- Objective-C
- OpenEdge ABL
- PLpgSQL
- Perl
- Python
- R
- Ruby
- Rust
- SCSS
- SQL
- Scala
- Shell
- Starlark
- Swift
- SystemVerilog
- TSQL
- TeX
- TypeScript
- Vim Script
- Visual Basic .NET
- Vue
- Zig
Starred repositories
Design patterns implemented in Java
Free and Open Source, Distributed, RESTful Search Engine
Guice (pronounced 'juice') is a lightweight dependency injection framework for Java 11 and above, brought to you by Google.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
OpenRefine is a free, open source power tool for working with messy data and improving it
Apache Beam is a unified programming model for Batch and Streaming data processing.
Statistical Machine Intelligence & Learning Engine
Serve, optimize and scale PyTorch models in production
Example code from Learning Spark book
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and bat…
Collect, aggregate, and visualize a data ecosystem's metadata
Apache Kafka, Apache Flink and Confluent Platform examples and demos
Java implementation of algorithms from Russell And Norvig's "Artificial Intelligence - A Modern Approach"
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
An extensible distributed system for reliable nearline data streaming at scale
Method stubs and test cases for the problems from Elements of Programming Interviews
Osmosis is a command line Java application for processing OSM data.
Data Structures and Algorithms in Java (useful in interview process)
📚 Cracking the Coding Interview 6th edition problems
AWS libraries/modules for working with Kinesis aggregated record data
Simple JVM Profiler Using StatsD and Other Metrics Backends
Java client library for GeoServer
Supporting repository for the blog post at https://medium.com/@stephane.maarek/how-to-use-apache-kafka-to-transform-a-batch-pipeline-into-a-real-time-one-831b48a6ad85
SemanticVectors creates semantic WordSpace models from free natural language text.