When I began this project we had 3,622 tags, of which more than two-thirds are singletons (2,666). In sorting through our existing list to see what’s what, I have already pared it down to about 3,560.
I make a living working with databases and am somewhat fanatic about having a clean dataset. The banes of my existence are duplicates and ambiguous or incomplete information. Keeping database size down is also a concern. Size can affect speed and performance. The first phase of the Tag project will be to clean up duplicates and ambiguous info. The second phase we’ll look at the singletons and see where we can consolidate tags to keep the numbers and size down.
Before we go any further, meet our Official Tag Team mascots: