1 - 25 of 27 available results.
flink logo
flink

50M+

436
Updated 6 days ago
Apache Flink® is a powerful open-source distributed stream and batch processing framework.
Data Science
storm logo
storm

5M+

201
Updated 20 days ago
Apache Storm is a free and open source distributed realtime computation system.
Data Science
geonetwork logo
geonetwork

5M+

88
Updated 6 days ago
GeoNetwork is a FOSS catalog for spatially referenced resources.
Data Science
couchdb logo
apache/couchdb

50M+

26

By The Apache Software Foundation

Updated 4 months ago
Unofficial convenience binaries for CouchDB, the RESTful document-oriented database
Data Science
Databases & Storage
arrow-dev logo
apache/arrow-dev

5M+

3

By The Apache Software Foundation

Updated a few seconds ago
Apache Arrow convenience images for development use
Data Science
Languages & Frameworks
Integration & Delivery
surprise logo
docker/surprise

10K+

4

By Docker, Inc.

Updated 6 years ago
A Whale of a Sixth Birthday Surprise
Data Science
Languages & Frameworks
Security
pdftk logo
pdftk/pdftk

10K+

11

By pdftk

Updated a day ago
GCJ-free toolkit for manipulating PDF documents
Content Management System
Data Science
Security
chipseq logo
sibswiss/chipseq

217

1
The ChIP-Seq tools: a resource for analyzing ChIP-seq and other types of genomic data
Data Science
Machine Learning & AI
ucnebase logo
sibswiss/ucnebase

127

0
A database of ultra-conserved non-coding elements and genomic regulatory blocks
Data Science
Machine Learning & AI
nicolasverlhiac/geocode-csv

100K+

82

By nicolasverlhiac

Updated 10 months ago
Geocode (Latitude and longitude) CSV files containing contact addresses using Geocoding Providers.
Data Science
Developer Tools
nessie logo
projectnessie/nessie

100K+

0

By projectnessie

Updated a year ago
A Git-Like Experience for your Data Lake. The repository has moved. Check overview for details
Data Science
Databases & Storage
xjxjin/alist-sync

50K+

1

By xjxjin

Updated 4 days ago
Alist-Sync - A tool for syncing files between Alist storages
Data Science
Databases & Storage
Web Analytics
flink logo
s390x/flink

10K+

0

By s390x

Updated 6 years ago
Apache Flink® is a powerful open-source distributed stream and batch processing framework.
Data Science
geonetwork logo
s390x/geonetwork

10K+

0

By s390x

Updated 6 years ago
GeoNetwork is a FOSS catalog for spatially referenced resources.
Data Science
jauderho/miller

9.9K

0

By jauderho

Updated 4 days ago
Miller is like awk, sed, cut, join, and sort for data formats such as CSV, TSV, tabular JSON etc.
Data Science
Developer Tools
jauderho/octosql

6.9K

0

By jauderho

Updated 15 hours ago
OctoSQL is a query tool that joins, analyses and transforms data from multiple databases
Data Science
Databases & Storage
Developer Tools
jauderho/rclone

6.6K

0

By jauderho

Updated 16 hours ago
rclone is a cli app to sync files and dirs between cloud providers
Data Science
Databases & Storage
Developer Tools
storm logo
s390x/storm

6.6K

0

By s390x

Updated 6 years ago
Apache Storm is a free and open source distributed realtime computation system.
Data Science
tobi312/tools

5.6K

0

By tobi312

Updated 4 hours ago
Tools collection - Docker Images for amd64, arm64, arm (Raspberry Pi)
Data Science
Internet of Things
Message Queues
jauderho/fq

5.5K

0

By jauderho

Updated 16 hours ago
fq is a tool for inspecting binary data
Data Science
Developer Tools
jauderho/visidata

5.3K

1

By jauderho

Updated 15 hours ago
Visidata is a terminal interface for exploring and arranging tabular data
Data Science
Databases & Storage
Developer Tools
jauderho/dsq

5.2K

0

By jauderho

Updated 15 hours ago
dsq is a CLI companion to DataStation (a GUI) for running SQL queries against data files
Data Science
Databases & Storage
Developer Tools
nessie-unstable logo
projectnessie/nessie-unstable

3.9K

0

By projectnessie

Updated a year ago
Nightly version of Nessie. The repository has moved. Check overview for details
Data Science
Databases & Storage
jauderho/gron

2.4K

0

By jauderho

Updated 17 hours ago
gron transforms JSON into manageable chunks
Data Science
Developer Tools
jauderho/textql

2.2K

0

By jauderho

Updated 16 hours ago
textql allows you to easily execute SQL against structured text like CSV or TSV
Data Science
Databases & Storage
Developer Tools