awebsite/TAG_CONSOLIDATION_SUMMARY.md

173 lines
4.5 KiB
Markdown
Raw Normal View History

2025-10-06 18:01:45 +02:00
# Tag Consolidation Summary
## Results
- **Before**: 353 unique tags
- **After**: 257 unique tags
- **Reduction**: 96 tags (27% reduction)
- **Files changed**: 49 files
- **Individual changes**: 148 tag modifications
## Key Consolidations Applied
### 1. Capitalization Standardization
All tags now use lowercase:
- `AI``ai`
- `Unity``unity`
- `VLF``vlf`
- `3D printing``3d printing`
- `Arduino``arduino`
- `Linux``linux`
### 2. Language Standardization (English Only)
Removed German tags from all files:
- `programmierung``programming`
- `mobile werkstatt``mobile workshop`
- `urbane intervention``urban intervention`
- `bildung``education`
### 3. Concept Consolidation
**3D Printing/Fabrication** (consolidated to `3d printing`):
- additive manufacturing
- filastruder
- filament
- design for printing
**Energy** (consolidated to `energy`):
- electricity
- solar
- grid
**Sustainability** (consolidated to `sustainability`):
- cradle-to-cradle
- circular
- environment (when context appropriate)
**Recycling** (consolidated to `recycling`):
- waste
- precious plastic
- shredder
- plastics-as-waste
- collaborative recycling
**Data** (consolidated appropriately):
- `data collection``data`
- `data viz``data visualization`
**Interactive** (consolidated to `interactive`):
- game
- 1st person
- 2 player
- 3rd person
**Infrastructure**:
- hosting → `infrastructure`
- decentral → `decentralized`
**Research/Academia**:
- university → `research`
- master thesis → `thesis`
**Programming/ML**:
- rust → `programming`
- physics → `programming` (when used in context)
- ml → `machine learning`
**Media Theory** (person names consolidated):
- geert lovink → `media theory`
- evgeny morozov → `media theory`
- lisa parks → `media theory`
- francis hunger → `media theory`
**Philosophy** (person names consolidated):
- alison jaggar → `philosophy`
- elizabeth anderson → `philosophy`
- elsa dorlin → `philosophy`
- francois ewald → `philosophy`
- josé medina → `philosophy`
- judith butler → `philosophy`
- michael foucault → `philosophy`
- miranda fricker → `philosophy`
**Other Consolidations**:
- automatic → `automation`
- automatic1111 → `stable diffusion`
- blogging → `communication`
- cyberpunk → `speculative design`
- scaling → `design`
- privat → `work`
### 4. Tags Removed
- `TODO, unfinished` - removed from 7 files (not a meaningful public tag)
## Top 30 Most Used Tags (After Consolidation)
1. university of the arts berlin (26)
2. 3d printing (23)
3. work (17)
4. private (17)
5. experiment (16)
6. university of osnabrück (14)
7. studio d+c (14)
8. democratic (14)
9. recycling (12)
10. engineering (12)
11. python (10)
12. interactive (10)
13. energy (10)
14. sustainability (8)
15. media theory (8)
16. making (8)
17. education (8)
18. einszwovier (7)
19. urban intervention (6)
20. unity (6)
21. thesis (6)
22. research (6)
23. radio (6)
24. programming (6)
25. plastics (6)
26. india (6)
27. decentralized (6)
28. technische universität berlin (4)
29. suv (4)
30. stable diffusion (4)
## Recommendations for Further Reduction
To reach closer to 50 tags, consider:
1. **Institution tags**: Keep only main institutions, remove specific departments
- Keep: `university of the arts berlin`, `university of osnabrück`
- Consider generalizing project-specific ones
2. **Tool-specific tags**: Consolidate specific tools into broader categories
- `google dialogflow`, `google cloud`, `google assistant``voice assistant` or `ai`
- `tensorflow`, `keras``machine learning`
3. **Material tags**: Group related materials
- `plastics`, `clay`, `iron``materials`
4. **Specific technologies**: Keep only if frequently used
- Consider removing one-off tools/libraries
5. **Geographic specificity**:
- `india`, `himalaya`, `iit kharagpur` → Consider just `india`
## Tag Guidelines Going Forward
1. **Always lowercase** - no exceptions
2. **English only** - never translate tags
3. **Use existing tags** - check the list before creating new ones
4. **Prefer general over specific** - unless the specific tag will be used 3+ times
5. **Remove person names** - use topic tags instead
6. **Avoid TODO/meta tags** - those belong in draft status, not tags
## Scripts Created
- `scripts/analyze_tags.sh` - Analyze and count tag usage
- `scripts/consolidate_tags.py` - Main consolidation script
- `scripts/final_tag_cleanup.py` - Final cleanup pass
- `scripts/tag_consolidation_map.txt` - Reference mapping document