Rucio helps you to manage your community's data

Built on a decade of operational experience, Rucio serves the data needs of modern scientific experiments.
Tremendous amounts of data, countless numbers of files, heterogeneous storage systems, globally distributed data centres, monitoring and analytics. All coming together in modular solution to fit your needs.

Extremely scalable

Need to search through billions of files? Need to transfer petabytes of data? Rucio has got you covered. Our largest installation is responsible for more than 350 Petabytes of data, stored in a billion files, and distributed over 120 data centres globally.

Policy-driven

Declarative data management allows you to say what you want, and let Rucio figure out the details how to do it. Manage your data with expressive statements. Three copies of my file on different continents, and have one backup on tape? Automatically remove it once its access popularity goes to zero? No problem.

Insights and analytics

Follow your data evolution over time, so you can keep control. From the popularity of your files, to the storage space and tape accounting of your data centres. Fully integrated with Graphite, ElasticSearch, and Hadoop.

Features? We got them.

  • Smart namespace

    Organise your files in datasets and containers, create virtual overlaps, distribute them by scope, or attach important metadata.

  • Storage support

    Rucio connects your existing storage, and you can easily add new and different ones. Even tapes, cloud-based storage, or supercomputers. We want you to have choice and not lock you down to a single solution.

  • Easy integration

    Existing applications and workflow systems can be integrated easily through our open libraries and REST servers. Rucio will not disrupt your experiment's workflows.

  • Authentication and authorisation

    The classic username/password, x509 certificates with proxy support, GSS/Kerberos, and SSH public keys are all supported.

  • Monitoring

    Directly integrated with ElasticSearch and Graphite, so you will never lose track of your data. Follow system performance from a single file to the global overview.

  • Open source powered

    Robust code written in the Python language, unit-tested, PEP-certified. Deploy with pip or containers. It's free as in freedom (Apache v2) and open source!

  • Consistency

    Data loss happens every day, and Rucio is prepared. Smart consistency and recovery mechanisms help you not to lose your data due to hardware failures.

  • Proven track record

    Originally built to withstand the requirements of the high-energy physics experiment ATLAS, Rucio is scalable and robust. Also serving smaller communities, for example, AMS or Xenon1t, Rucio makes a scientist's life easier.

Get in touch

We are always happy to chat. You can drop us a mail and we will reply as quickly as possible.