Quilt connects teams to actionable data by simplifying data discovery, sharing, and analysis. It’s designed to serve data-driven organizations with powerful tools for managing data as code, enabling rapid experimentation, and ensuring data integrity at scale.
Quilt consists of three main elements:
- Quilt Platform which is a cloud platform for interacting with, visualizing, searching and querying Quilt Packages, which is hosted in an organization's AWS Account.
- Quilt Python SDK which provides the ability to create, push, install and delete Quilt Packages.
- Quilt Ecosystem which provide extension of the core Quilt Capabilities to enable typical elements of life sciences workflows, such as incorporating orchestration data, and connecting packages to Electronic Lab Notebooks.
To dive deeper into the capabilities of Quilt, start with our Quick Start Guide or explore the Installation Instructions for setting up your environment.
If you have any questions or need help, join our Slack community or submit a support request to [email protected].
The Quilt documentation is structured to guide users through different layers of the platform, from basic concepts to advanced integrations. Whether you're a business user, developer, or platform administrator, the docs will help you quickly find the information you need.
The Quilt Platform powers the core features of the Quilt data catalog, providing tools for browsing, searching, and visualizing data stored in AWS S3. The platform is ideal for teams needing to collaborate on data, with capabilities like embeddable previews and metadata collection.
Core Sections:
- Architecture - Learn how Quilt is architected.
- Mental Model - Understand the guiding principles behind Quilt.
- Metadata Management - Manage metadata at scale.
For users of the Quilt Platform (often referred to as the Catalog):
- Bucket Browsing - Navigate through S3 buckets.
- Document Previews - Visualize documents and datasets directly in the web interface.
- Search - Leverage Quilt’s powerful search capabilities.
- Visualization & Dashboards - Create visual dashboards for data insights.
For administrators managing Quilt deployments:
- Admin Settings UI - Control platform settings and user access.
- Catalog Configuration - Set platform preferences.
- Cross-Account Access - Manage multi-account access to S3 data.
The Quilt Python SDK allows users to programmatically manage data packages, version datasets, and automate data workflows. Whether you're uploading a package, fetching data, or scripting custom workflows, the SDK provides the flexibility needed for deeper integrations.
- Installation - Get started with the Quilt SDK.
- Quick Start - Follow a step-by-step guide to building and managing data packages.
- Editing and Uploading Packages - Learn how to version, edit, and share data.
- API Reference - Detailed API documentation for developers.
The Quilt Ecosystem extends the platform with integrations and plugins to fit your workflow. Whether you're managing scientific data or automating packaging tasks, Quilt can be tailored to your needs with these tools:
- Benchling Packager - Package electronic lab notebooks from Benchling.
- Nextflow Plugin - Integrate with Nextflow pipelines for bioinformatics.
Quilt is for teams across industries like machine learning, biotech, and analytics who need to manage large datasets, collaborate seamlessly, and track the lifecycle of their data. Whether you're a data scientist, engineer, or administrator, Quilt helps streamline your data management workflows.
- Share: Easily share versioned data using simple URLs and email invites.
- Understand: Enrich data with inline documentation and visualizations for better insights.
- Discover: Use metadata and search tools to explore data relationships across projects.
- Model: Version and manage large data sets that don't fit traditional git repositories.
- Decide: Empower your team with auditable data for better decision-making.