feat: add taxonomy classify service + /classify endpoint
- scripts/taxonomy.py: shared taxonomy with 14 categories, keyword scorer and classify_text() function - scripts/classify_server.py: FastAPI service — forwards to kreuzberg /extract, applies taxonomy, returns category/subcategory/confidence alongside full kreuzberg response - Dockerfile.classify: lightweight Python image for classify service - classify.nomad: Nomad job → classify.i80.dk - .gitea/workflows/classify.yml: CI/CD pipeline (build + deploy) - analyse_familie.py: refactored to import from taxonomy.py (no duplication) - .gitignore: exclude dokumenter_keywords.* and extract_all.log
This commit is contained in:
6
.gitignore
vendored
6
.gitignore
vendored
@@ -245,6 +245,12 @@ obj/
|
||||
pkg/
|
||||
|
||||
|
||||
# Doc classifier output files
|
||||
dokumenter_keywords.json
|
||||
dokumenter_keywords.classified.csv
|
||||
dokumenter_keywords.misplaced.csv
|
||||
extract_all.log
|
||||
|
||||
# Local dev artifacts
|
||||
docs/demo-dev.html
|
||||
docs/serve.json
|
||||
|
||||
Reference in New Issue
Block a user