more translation
This commit is contained in:
parent
2ce9ca50b5
commit
a41be821c1
997 changed files with 33247 additions and 32490 deletions
172
TAG_CONSOLIDATION_SUMMARY.md
Normal file
172
TAG_CONSOLIDATION_SUMMARY.md
Normal file
|
|
@ -0,0 +1,172 @@
|
|||
# Tag Consolidation Summary
|
||||
|
||||
## Results
|
||||
|
||||
- **Before**: 353 unique tags
|
||||
- **After**: 257 unique tags
|
||||
- **Reduction**: 96 tags (27% reduction)
|
||||
- **Files changed**: 49 files
|
||||
- **Individual changes**: 148 tag modifications
|
||||
|
||||
## Key Consolidations Applied
|
||||
|
||||
### 1. Capitalization Standardization
|
||||
All tags now use lowercase:
|
||||
- `AI` → `ai`
|
||||
- `Unity` → `unity`
|
||||
- `VLF` → `vlf`
|
||||
- `3D printing` → `3d printing`
|
||||
- `Arduino` → `arduino`
|
||||
- `Linux` → `linux`
|
||||
|
||||
### 2. Language Standardization (English Only)
|
||||
Removed German tags from all files:
|
||||
- `programmierung` → `programming`
|
||||
- `mobile werkstatt` → `mobile workshop`
|
||||
- `urbane intervention` → `urban intervention`
|
||||
- `bildung` → `education`
|
||||
|
||||
### 3. Concept Consolidation
|
||||
|
||||
**3D Printing/Fabrication** (consolidated to `3d printing`):
|
||||
- additive manufacturing
|
||||
- filastruder
|
||||
- filament
|
||||
- design for printing
|
||||
|
||||
**Energy** (consolidated to `energy`):
|
||||
- electricity
|
||||
- solar
|
||||
- grid
|
||||
|
||||
**Sustainability** (consolidated to `sustainability`):
|
||||
- cradle-to-cradle
|
||||
- circular
|
||||
- environment (when context appropriate)
|
||||
|
||||
**Recycling** (consolidated to `recycling`):
|
||||
- waste
|
||||
- precious plastic
|
||||
- shredder
|
||||
- plastics-as-waste
|
||||
- collaborative recycling
|
||||
|
||||
**Data** (consolidated appropriately):
|
||||
- `data collection` → `data`
|
||||
- `data viz` → `data visualization`
|
||||
|
||||
**Interactive** (consolidated to `interactive`):
|
||||
- game
|
||||
- 1st person
|
||||
- 2 player
|
||||
- 3rd person
|
||||
|
||||
**Infrastructure**:
|
||||
- hosting → `infrastructure`
|
||||
- decentral → `decentralized`
|
||||
|
||||
**Research/Academia**:
|
||||
- university → `research`
|
||||
- master thesis → `thesis`
|
||||
|
||||
**Programming/ML**:
|
||||
- rust → `programming`
|
||||
- physics → `programming` (when used in context)
|
||||
- ml → `machine learning`
|
||||
|
||||
**Media Theory** (person names consolidated):
|
||||
- geert lovink → `media theory`
|
||||
- evgeny morozov → `media theory`
|
||||
- lisa parks → `media theory`
|
||||
- francis hunger → `media theory`
|
||||
|
||||
**Philosophy** (person names consolidated):
|
||||
- alison jaggar → `philosophy`
|
||||
- elizabeth anderson → `philosophy`
|
||||
- elsa dorlin → `philosophy`
|
||||
- francois ewald → `philosophy`
|
||||
- josé medina → `philosophy`
|
||||
- judith butler → `philosophy`
|
||||
- michael foucault → `philosophy`
|
||||
- miranda fricker → `philosophy`
|
||||
|
||||
**Other Consolidations**:
|
||||
- automatic → `automation`
|
||||
- automatic1111 → `stable diffusion`
|
||||
- blogging → `communication`
|
||||
- cyberpunk → `speculative design`
|
||||
- scaling → `design`
|
||||
- privat → `work`
|
||||
|
||||
### 4. Tags Removed
|
||||
- `TODO, unfinished` - removed from 7 files (not a meaningful public tag)
|
||||
|
||||
## Top 30 Most Used Tags (After Consolidation)
|
||||
|
||||
1. university of the arts berlin (26)
|
||||
2. 3d printing (23)
|
||||
3. work (17)
|
||||
4. private (17)
|
||||
5. experiment (16)
|
||||
6. university of osnabrück (14)
|
||||
7. studio d+c (14)
|
||||
8. democratic (14)
|
||||
9. recycling (12)
|
||||
10. engineering (12)
|
||||
11. python (10)
|
||||
12. interactive (10)
|
||||
13. energy (10)
|
||||
14. sustainability (8)
|
||||
15. media theory (8)
|
||||
16. making (8)
|
||||
17. education (8)
|
||||
18. einszwovier (7)
|
||||
19. urban intervention (6)
|
||||
20. unity (6)
|
||||
21. thesis (6)
|
||||
22. research (6)
|
||||
23. radio (6)
|
||||
24. programming (6)
|
||||
25. plastics (6)
|
||||
26. india (6)
|
||||
27. decentralized (6)
|
||||
28. technische universität berlin (4)
|
||||
29. suv (4)
|
||||
30. stable diffusion (4)
|
||||
|
||||
## Recommendations for Further Reduction
|
||||
|
||||
To reach closer to 50 tags, consider:
|
||||
|
||||
1. **Institution tags**: Keep only main institutions, remove specific departments
|
||||
- Keep: `university of the arts berlin`, `university of osnabrück`
|
||||
- Consider generalizing project-specific ones
|
||||
|
||||
2. **Tool-specific tags**: Consolidate specific tools into broader categories
|
||||
- `google dialogflow`, `google cloud`, `google assistant` → `voice assistant` or `ai`
|
||||
- `tensorflow`, `keras` → `machine learning`
|
||||
|
||||
3. **Material tags**: Group related materials
|
||||
- `plastics`, `clay`, `iron` → `materials`
|
||||
|
||||
4. **Specific technologies**: Keep only if frequently used
|
||||
- Consider removing one-off tools/libraries
|
||||
|
||||
5. **Geographic specificity**:
|
||||
- `india`, `himalaya`, `iit kharagpur` → Consider just `india`
|
||||
|
||||
## Tag Guidelines Going Forward
|
||||
|
||||
1. **Always lowercase** - no exceptions
|
||||
2. **English only** - never translate tags
|
||||
3. **Use existing tags** - check the list before creating new ones
|
||||
4. **Prefer general over specific** - unless the specific tag will be used 3+ times
|
||||
5. **Remove person names** - use topic tags instead
|
||||
6. **Avoid TODO/meta tags** - those belong in draft status, not tags
|
||||
|
||||
## Scripts Created
|
||||
|
||||
- `scripts/analyze_tags.sh` - Analyze and count tag usage
|
||||
- `scripts/consolidate_tags.py` - Main consolidation script
|
||||
- `scripts/final_tag_cleanup.py` - Final cleanup pass
|
||||
- `scripts/tag_consolidation_map.txt` - Reference mapping document
|
||||
Loading…
Add table
Add a link
Reference in a new issue