PlaneCycle

PlaneCycle: Training-Free 2D-to-3D Lifting of Foundation Models Without Adapters (arXiv)

Overview

PCA visualizations of frozen lifted DINOv3 features.
Evaluated on three 3D datasets across HW, DW, and DH planes (inconsistencies circled).

Overview of PlaneCycle across three orthogonal planes (HW, DW, DH).
Flattened slice tokens are processed by shared ViT layers with plane-specific RoPE.

Pretrained Models

We utilize DINOv3 as the backbone for our 2D-to-3D lifting. DINOv3 Weights: Please follow the official repository facebookresearch/dinov3 to download the pretrained checkpoints.

Requirements

Our code is built on top of the DINOv3 framework.

DINOv3 Environment: Follow the installation guide to set up the basic dependencies.
Additional Packages: medmnist, transformers

PlaneCycle Converter

Configure the following parameters in the converter.

Key Parameters

cycle_order Specifies the sequence of spatial planes for feature aggregation. Each block can be assigned a plane from `('HW', 'DW', 'DH').
- Default (Paper): ('HW', 'DW', 'DH', 'HW')
- Alternatives: ('HW', 'DW', 'DH') or any user-defined sequence.
pool_method
Specifies how global tokens are aggregated across planes.
- "PCg" (default): Uses adaptive pooling to preserve spatial token structure.
- "PCm": Uses mean pooling to obtain a global volumetric representation.

Example

import torch
from planecycle.converters.dinov3_converter import PlaneCycleConverter

REPO_DIR = <PATH/TO/DINOV3/REPOSITORY>

x = torch.randn(2, 3, 64, 256, 256) # （N, 3, D, H, W）

# Load a DINOv3 ViT backbone pretrained on web images
model = torch.hub.load(
    REPO_DIR,
    "dinov3_vits16",
    source="local",
    weights=<CHECKPOINT/URL/OR/PATH>
)

# Convert the 2D backbone into a 3D PlaneCycle model
converter = PlaneCycleConverter(
    cycle_order=('HW', 'DW', 'DH', 'HW'),
    pool_method="PCg"
)
model = converter(model)

out = model(x)

Code Structure

planecycle/
Core implementation of the PlaneCycle framework.
- operators/ – Implementation of the PlaneCycle operators.
- converters/ – Converters for adapting pretrained ViT backbones.
models/
- vision_transformer/ – Modified Vision Transformer implementation used in this project.
experiments/
Scripts for running experiments and reproducing results.
- medmnist/ – Training and benchmarking pipelines for six 3D MedMNIST+ datasets.

How to run the experiments

six 3D classification datasets(./experiments/medmnist/train_eval.py)

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
experiments/medmnist		experiments/medmnist
models		models
planecycle		planecycle
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PlaneCycle

Overview

Pretrained Models

Requirements

PlaneCycle Converter

Key Parameters

Example

Code Structure

How to run the experiments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PlaneCycle

Overview

Pretrained Models

Requirements

PlaneCycle Converter

Key Parameters

Example

Code Structure

How to run the experiments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages