repo_labor table taking up over half of the available database space

I notice that the `repo_labor` table in our db is consuming 1.3TB according to dbeaver.

from the disk usage measured by `sudo df -h`, the whole database is 2.3TB.


This table seems to be where the `scc` metrics get written. I can understand why they might be big if this table is tracking the lines of code and other metrics in every file for every change of every tracked repository.

But since we have several tables representing files (pull_request_files, and eventually commit_files #3682 ) already, why cant we store this data in a way that has a foreign key referring to existing file entries, deduplicating this table to reduce its space usage as much as possible.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

repo_labor table taking up over half of the available database space #3736

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

repo_labor table taking up over half of the available database space #3736

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions