Getting Started

Introduction

An overview of Envision Portal and its core capabilities.

Envision Portal is a web platform for managing, standardizing, and sharing biomedical research datasets. It is designed for research teams working with eye imaging data and other clinical datasets who need a structured way to document, version, and publish their work.

The platform is developed within the Eye ACT project to address common barriers in ophthalmic data reuse: fragmented sharing practices, missing metadata, and inconsistent file formats. Envision Portal focuses on making datasets more Findable, Accessible, Interoperable, and Reusable (FAIR) and better prepared for downstream AI workflows.

What Envision Portal Does

The platform covers the full lifecycle of a research dataset:

  • Create datasets with rich metadata following established standards (Dublin Core, DataCite, FAIR principles)
  • Document datasets with clinical trial study information and data quality documentation (Healthsheet)
  • Upload files to cloud storage (Azure Data Lake) with separate environments for drafts and published data
  • Publish datasets through a guided multi-step review workflow that generates a README and changelog
  • Control access with role-based team permissions, public access requests, and controlled access workflows with data use agreements
  • Discover publicly available datasets and register external datasets from other repositories

The long-term scope also includes automated indexing of eye imaging datasets from external repositories and advanced natural-language discovery features.

Who It Is For

Envision Portal is intended for:

  • Researchers who create and manage datasets and need to document and share them
  • Data stewards who review access requests and manage team permissions
  • Collaborators who need read or edit access to a dataset
  • External users who want to discover and request access to published datasets

Key Concepts

Datasets

A dataset is the central object in the platform. Each dataset has a canonical ID that persists across versions, a current version number, and a status of either draft or published.

Metadata

Each dataset supports three layers of metadata:

LayerPurpose
Dataset MetadataGeneral information: titles, descriptions, contributors, identifiers, access rights, licensing
Study MetadataClinical trial information: design, arms, interventions, eligibility, locations, contacts
HealthsheetData quality documentation: motivation, composition, collection, preprocessing, distribution, uses, maintenance

Access Levels

Datasets can be accessed at different levels:

  • Public access allows anyone to request and download the dataset
  • Controlled access requires a formal request, a signed data use agreement, and approval from the dataset owner

The product roadmap also tracks support for additional access patterns (for example, restricted or policy-based access tiers) as governance requirements evolve.

Team Roles

Every dataset has a team of members with one of four roles: owner, admin, editor, or viewer. Roles determine what actions each member can take on the dataset.

Supported Data Standards

Envision Portal guidance and validation workflows are aligned with standards used for interoperability and reuse:

  • CDS (Clinical Dataset Structure) for consistent multi-modal dataset organization
  • DICOM for eye imaging files wherever possible
  • OMOP CDM-aligned tables for associated clinical and observational data
  • FAIR metadata practices to improve discoverability and reuse

These standards are expected to evolve with the community, and documentation will be updated accordingly.

Development Principles

The platform is developed with repository trust and sustainability in mind, including:

  • Transparent public documentation and open-source development
  • Persistent identifiers (DOIs) and reproducible versioning
  • User-focused workflows for both contributors and data consumers
  • Secure, maintainable cloud infrastructure and policy-aware access controls

Roadmap Snapshot

The Envision Portal roadmap spans 2024-2029, with phased milestones:

  • Initial architecture and standards definition
  • Early launch with hosted and externally indexed datasets
  • Contributor-facing automation for standardization and PHI checks
  • Advanced discovery, APIs, and controlled-access workflows
  • Sustainability, security hardening, and community-led growth
Copyright © 2026