Tagging update
Jul. 12th, 2022 10:39 pmAccording to the formulas in my master spreadsheet, I've sorted 6.35% of the groups so far. (Of course, this only counts groups in complete tabs; I haven't entered group numbers for incomplete tabs yet, so groups sorted out onto all of the many smaller languages' tabs aren't factored into that.)
There are only a couple volunteers so far (I'll wait to advertise widely until I've gotten most of the sorting done, I think), but they're doing almost a beta test of the tagging system, catching poorly-thought-out or missing tags in the nonfandom tag list, and helping to establish precedents. They will prove invaluable as later volunteers come on board, since they're familiar with the process and can answer many questions at this point. So far they've tagged 0.33% of the total groups, or 3,703 groups. (Once again, only complete tabs count, so the dozen or so minority language groups I tagged on "miscellaneous" tabs aren't included in that percentage.)
I already have tabs available in a good number of languages: English and Italian, of course, but also Spanish, Portuguese, Chinese, Indonesian, Arabic, Persian, Turkish, and Romanian. And there are a good number of "Unknown" and a couple Spam tabs. (The former almost certainly will need looking into the mbox files in order to tag either language or content or both; the latter is obvious spam and will probably be tagged as such.)
I've also started tabs in 42 other languages, from Afrikaans to Finnish to Greek to Macedonian. Obscure/minority languages on my "miscellaneous tabs" are, so far:
African - Ukwuani
Asian - Zo, Zotung, Tausug, Falam, Tedim, Mizo, Hakha, Tetun Dili
European - Breton
(I've been able to tag the majority of these groups with the help of a volunteer and some diligent searching online, but the Tausug and Tedim are stumping me, and I had to guess a little with the Ukwuani and Zotung descriptions. Breton, at least, I have someone I can ask for a translation, and if I didn't there are sufficient online resources.)
If a language seems likely to have more than one or two groups, it gets its own tab, even if it's a minority language. Languages such as Sundanese and Catalan are in this situation. They'll never have a full tab's worth, but I'd rather keep the handful of groups together.
Most of the tabs available are in business/finance or computer/Internet areas (unless you want to tag Italian groups, and then I have pretty much everything available), but some tabs (such as the ones from the cyberculture category) have a higher percentage of fandom in them. The category I'm currently working on (/Computers & Internet/Other/) was, I suspect, used by Yahoo to dump a whole horde of early groups from another list service (onelist or egroups) whether they belonged there or not, and as such has a high percentage of fandom, to the point that I'm pretty sure I'll be able to offer a tab of nothing but Backstreet Boys, and another tab of nothing but Britney Spears. XD
There are only a couple volunteers so far (I'll wait to advertise widely until I've gotten most of the sorting done, I think), but they're doing almost a beta test of the tagging system, catching poorly-thought-out or missing tags in the nonfandom tag list, and helping to establish precedents. They will prove invaluable as later volunteers come on board, since they're familiar with the process and can answer many questions at this point. So far they've tagged 0.33% of the total groups, or 3,703 groups. (Once again, only complete tabs count, so the dozen or so minority language groups I tagged on "miscellaneous" tabs aren't included in that percentage.)
I already have tabs available in a good number of languages: English and Italian, of course, but also Spanish, Portuguese, Chinese, Indonesian, Arabic, Persian, Turkish, and Romanian. And there are a good number of "Unknown" and a couple Spam tabs. (The former almost certainly will need looking into the mbox files in order to tag either language or content or both; the latter is obvious spam and will probably be tagged as such.)
I've also started tabs in 42 other languages, from Afrikaans to Finnish to Greek to Macedonian. Obscure/minority languages on my "miscellaneous tabs" are, so far:
African - Ukwuani
Asian - Zo, Zotung, Tausug, Falam, Tedim, Mizo, Hakha, Tetun Dili
European - Breton
(I've been able to tag the majority of these groups with the help of a volunteer and some diligent searching online, but the Tausug and Tedim are stumping me, and I had to guess a little with the Ukwuani and Zotung descriptions. Breton, at least, I have someone I can ask for a translation, and if I didn't there are sufficient online resources.)
If a language seems likely to have more than one or two groups, it gets its own tab, even if it's a minority language. Languages such as Sundanese and Catalan are in this situation. They'll never have a full tab's worth, but I'd rather keep the handful of groups together.
Most of the tabs available are in business/finance or computer/Internet areas (unless you want to tag Italian groups, and then I have pretty much everything available), but some tabs (such as the ones from the cyberculture category) have a higher percentage of fandom in them. The category I'm currently working on (/Computers & Internet/Other/) was, I suspect, used by Yahoo to dump a whole horde of early groups from another list service (onelist or egroups) whether they belonged there or not, and as such has a high percentage of fandom, to the point that I'm pretty sure I'll be able to offer a tab of nothing but Backstreet Boys, and another tab of nothing but Britney Spears. XD