apache / spark
Apache Spark - A unified analytics engine for large-scale data processing
See what the GitHub community is most excited about today.
Apache Spark - A unified analytics engine for large-scale data processing
♞ lichess.org: the forever free, adless and open source chess server ♞
The Community Maintained High Velocity Web Framework For Java and Scala.
sbt, the interactive build tool
Build highly concurrent, distributed, and resilient message-driven applications on the JVM
Open-source high-performance RISC-V processor
Chisel: A Modern Hardware Design Language
Removes large or troublesome blobs like git-filter-branch does, but faster. And written in Scala
Prevents leaking sensitive fields defined inside `case class`
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
An open protocol for secure data sharing
FEEL parser and interpreter written in Scala
The Daml smart contract language
Modern Load Testing as Code
Scala 2 compiler and standard library. Bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3
Sharding and location transparency for Scala
XML data source for Spark SQL and DataFrames
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Apache Spark Connector for SQL Server and Azure SQL
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
A Spark plugin for reading and writing Excel files
A Git platform powered by Scala with easy installation, high extensibility & GitHub API compatibility
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
The batteries-included testing and formal verification library for Chisel-based RTL designs.
Code formatter for Scala