Demos of Banyan Julia/Python
What is Banyan Julia/Python?
Banyan Julia/Python is a suite of software libraries for large-scale data processing with automatic instant data sampling.
- Banyan.jl for custom scripting
- BanyanArrays.jl for processing large n-dimensional arrays
- BanyanImages.jl for scalable image processing
- BanyanHDF5.jl for big HDF5 data
- BanyanDataFrames.jl for high-performance tabular data analytics
- BanyanONNXRunTime.jl for large-scale machine learning inference
- banyan-python for custom scripting
Self-Service Demos
We provide self-service demos of Banyan Julia/Python that can be found at github.com/banyan-team/banyan-julia-examples and at github.com/banyan-team/banyan-python-examples.
- Banyan Julia
iris.ipynb
- This notebook is a good starting place if you are new to Banyan. It provides a simple example of how to spin up a cluster and a job and how to run some data analytics on a CSV file on the cluster.nyc_taxi.ipynb
- Large-scale data analysis of CSV dataset stored in the cloud (S3)nyc_taxi_sampling.ipynb
- Sampling large dataset stored in S3parameter_tuning.ipynb
- Massively parallel parameter tuning in the cloudshallow_water_modeling.ipynb
- MPI-powered fluid dynamics simulationsatellite_image_encoding
- PyTorch-based encoding of thousands of NASA satellite images (Unreleased)
- Banyan Python
flow_simulations
- Flow simulation with Banyan Python Custom Scripting
Simply follow the steps to get started by setting up your Banyan account and then run the example notebooks to experience Banyan Julia/Python for yourself.