personal-ai-assistant/plans/phase-4-documents-memory.md at master

dolgolyov.alexei 8b8fe916f0 Phase 4: Documents & Memory — upload, FTS, AI tools, context injection

Backend:
- Document + MemoryEntry models with Alembic migration (GIN FTS index)
- File upload endpoint with path traversal protection (sanitized filenames)
- Background document text extraction (PyMuPDF)
- Full-text search on extracted_text via PostgreSQL tsvector/tsquery
- Memory CRUD with enum-validated categories/importance, field allow-list
- AI tools: save_memory, search_documents, get_memory (Claude function calling)
- Tool execution loop in stream_ai_response (multi-turn tool use)
- Context assembly: injects critical memory + relevant doc excerpts
- File storage abstraction (local filesystem, S3-swappable)
- Secure file deletion (DB flush before disk delete)

Frontend:
- Document upload dialog (drag-and-drop + file picker)
- Document list with status badges, search, download (via authenticated blob)
- Document viewer with extracted text preview
- Memory list grouped by category with importance color coding
- Memory editor with category/importance dropdowns
- Documents + Memory pages with full CRUD
- Enabled sidebar navigation for both sections

Review fixes applied:
- Sanitized upload filenames (path traversal prevention)
- Download via axios blob (not bare <a href>, preserves auth)
- Route ordering: /search before /{id}/reindex
- Memory update allows is_active=False + field allow-list
- MemoryEditor form resets on mode switch
- Literal enum validation on category/importance schemas
- DB flush before file deletion for data integrity

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Column	Type	Constraints
id	UUID	PK (inherited)
user_id	UUID	FK -> users.id CASCADE, NOT NULL, indexed
filename	VARCHAR(255)	NOT NULL (stored name)
original_filename	VARCHAR(255)	NOT NULL
storage_path	TEXT	NOT NULL
mime_type	VARCHAR(100)	NOT NULL
file_size	BIGINT	NOT NULL
doc_type	VARCHAR(50)	NOT NULL, default 'other'
extracted_text	TEXT	NULL
processing_status	VARCHAR(20)	NOT NULL, default 'pending'
metadata_	JSONB	NULL
created_at	TIMESTAMPTZ	inherited

Column	Type	Constraints
id	UUID	PK (inherited)
user_id	UUID	FK -> users.id CASCADE, NOT NULL, indexed
category	VARCHAR(50)	NOT NULL
title	VARCHAR(255)	NOT NULL
content	TEXT	NOT NULL
source_document_id	UUID	FK -> documents.id SET NULL, NULL
importance	VARCHAR(20)	NOT NULL, default 'medium'
is_active	BOOLEAN	NOT NULL, default true
created_at	TIMESTAMPTZ	inherited

4.9 KiB

Raw Permalink Blame History

Phase 4: Documents & Memory — Subplan

Goal

Prerequisites

Database Schema (Phase 4)

`documents` table

`memory_entries` table

Tasks

A. Backend Models & Migration (Tasks 1–4)

B. Backend Config & Utilities (Tasks 5–7)

C. Backend Schemas (Tasks 8–9)

D. Backend Services (Tasks 10–13)

E. Backend API Endpoints (Tasks 14–16)

F. Frontend API (Tasks 17–18)

G. Frontend Document Pages (Tasks 19–22)

H. Frontend Memory Pages (Tasks 23–25)

I. Routing, Sidebar, i18n (Tasks 26–28)

J. Backend Tests (Tasks 29–30)

Acceptance Criteria

Status

4.9 KiB Raw Permalink Blame History Unescape Escape

Phase 4: Documents & Memory — Subplan

Goal

Prerequisites

Database Schema (Phase 4)

documents table

memory_entries table

Tasks

A. Backend Models & Migration (Tasks 1–4)

B. Backend Config & Utilities (Tasks 5–7)

C. Backend Schemas (Tasks 8–9)

D. Backend Services (Tasks 10–13)

E. Backend API Endpoints (Tasks 14–16)

F. Frontend API (Tasks 17–18)

G. Frontend Document Pages (Tasks 19–22)

H. Frontend Memory Pages (Tasks 23–25)

I. Routing, Sidebar, i18n (Tasks 26–28)

J. Backend Tests (Tasks 29–30)

Acceptance Criteria

Status

4.9 KiB

Raw Permalink Blame History

`documents` table

`memory_entries` table