Wikispecies:Bots/Requests for approval/MonoBot
- The following discussion is preserved as an archive. Please do not modify it. Subsequent comments should be made on the appropriate discussion page. No further edits should be made to this discussion.
- Operator: Mønobi
- Automatic or Manually Assisted: auto
- Programming Language(s): python (using pywikipedia)
- Function Summary: auto create pages listed on User:MonoBot/Requested Articles
- Edit period(s) (e.g. Continuous, daily, one time run): on a crontab (few times a day to process the articles)
- Edit rate requested: maxlag = 5
- Already has a bot flag (Y/N): No
- Function Details: Grabs data from ITIS, spits it back out in wikispecies format ;)
Discussion
editI had MonoBot create the following pages:
Other than the annoying line break between the template and the {{genus}}
template, what else needs added (please feel free to change the articles directly so I can see what needs to be done :)
) ? Mønobi 02:03, 12 March 2008 (UTC)[reply]
- I changed Etheostoma asprigene to reflect the changes to be done:
- Remove line brake between genus and species.
- Add a space after species:
- Put Reference section before Vernacular Names section.
- Add Name section.
- Add author name if possible at the name section.
- Mariusm 05:41, 12 March 2008 (UTC)[reply]
- There should be a space between colon and the species name. [1] Done
- A bullet is needed in front of reference [2] Done
- Species name should be short-formed (e.g. on the page Etheostoma acuticeps, the species name should be written as E. acuticeps, note that this is not the page name, merely the species name inside the article) [3] not sure per Mariusm's comment below
- Add interwiki links if possible. [4] Working on it...
- OhanaUnited, I Thought we resolved, after the discussion at the Village Pump to write the species name in the long form rather than the short one. Bear in mind that 95% of pages are currently written in the long form.
- We should prefer
- Species: Homo sapiens
- Only where a list is involved like:
- Subspecies: † H. s. idaltu - H. s. sapiens
- There should be the shorter form!
- That's why I said it's ambiguous to have some in short form and other time in long form. It's "all or nothing" approach, no gray areas. OhanaUnitedTalk page 13:52, 12 March 2008 (UTC)[reply]
- My bad on not spelling out Etheostoma for every species on the genus page. My only caution is that the file would be so much larger if Etheostoma was completely spelled out for all 90 species. There are other genuses with even more species. Sure takes up a lot of room! Pvmoutside 22:37, 12 March 2008 (UTC)[reply]
- This problem has been disscussed several times before without any satisfactory result. My personal opinion is ti have the species, subspecies etc... in short form. If someone is confused by Subspecies: † H. s. idaltu all they need to to is look to the Page name, the Species line, the subspecies line, the name section, and (on some ) the image caption. With the full name written out in up to 4 other spots on the page it doesn't need to be written in full in the Species/subspecies line of the Taxonavigation section. Kevmin 22:29, 14 March 2008 (UTC)[reply]
- My bad on not spelling out Etheostoma for every species on the genus page. My only caution is that the file would be so much larger if Etheostoma was completely spelled out for all 90 species. There are other genuses with even more species. Sure takes up a lot of room! Pvmoutside 22:37, 12 March 2008 (UTC)[reply]
- That's why I said it's ambiguous to have some in short form and other time in long form. It's "all or nothing" approach, no gray areas. OhanaUnitedTalk page 13:52, 12 March 2008 (UTC)[reply]
Question: Can you generate pages starting from an extensive excel file? Lycaon 19:51, 12 March 2008 (UTC)[reply]
I created four more pages:
Other than adding the interwiki link, possibly {{commons}}
and an image, anything else need to be done? Also, Lycaon, I'm not sure if I can use excel files. My bot is on my linux machine and it's run purely from a command line. If their is a way to extract that data through some sort of feed or place it into a .txt file, then I could. Mønobi 20:49, 12 March 2008 (UTC)[reply]
- Wiki-link the author name. OhanaUnitedTalk page 21:42, 12 March 2008 (UTC)[reply]
- What if the author (page) doesn't exist? Still link? Mønobi 22:17, 12 March 2008 (UTC)[reply]
- Draft bot looks awesome Monobi. Will be nice not to do it all by hand! 2 suggestions,
1) adding synonyms where available and
2) Capitalizing both names in the English vernacular (i.e. Rainbow Darter)
rather than Rainbow darter or rainbow darter).....Pvmoutside 22:37, 12 March 2008 (UTC)[reply]
- Working on synonyms, added Capitalization. Anything else
:)
Mønobi 02:17, 13 March 2008 (UTC)[reply]
- Working on synonyms, added Capitalization. Anything else
- Monobi: Yes, even if the author page is a red link, still link. Now I don't know how often it links to an author with the same last name but different person. (e.g. 2 authors share the same last name. However, one guy is from 1700s and another is from 1900s. I encountered a few times in Archaea where I linked to the 1700s author when Archaea wasn't formed until 1977) Another thing, don't worry too much on VN or interwiki, VN bot can take care of that. OhanaUnitedTalk page 02:44, 13 March 2008 (UTC)[reply]
- And seriously, apply for a bot flag or it will clog up recent changes page. It's not a bot until it makes on this list.[5] OhanaUnitedTalk page 02:55, 13 March 2008 (UTC)[reply]
- I was wanting to wait until we had consensus to flag it.
:)
Mønobi 03:05, 13 March 2008 (UTC)[reply]
- I was wanting to wait until we had consensus to flag it.
- And seriously, apply for a bot flag or it will clog up recent changes page. It's not a bot until it makes on this list.[5] OhanaUnitedTalk page 02:55, 13 March 2008 (UTC)[reply]
- Monobi: Yes, even if the author page is a red link, still link. Now I don't know how often it links to an author with the same last name but different person. (e.g. 2 authors share the same last name. However, one guy is from 1700s and another is from 1900s. I encountered a few times in Archaea where I linked to the 1700s author when Archaea wasn't formed until 1977) Another thing, don't worry too much on VN or interwiki, VN bot can take care of that. OhanaUnitedTalk page 02:44, 13 March 2008 (UTC)[reply]
- ""Vernacular names" vs. "Vernacular Names". From the examples (Panthera tigris, Help:Vernacular names section) I've seen, "names" is lowercase in the section heading.
- I think you caught this, but bot was adding References section before Name section. Section order should be: Name, References, Vernacular names.
- "Rainbow darter" vs "Rainbow Darter". I would leave it as "Rainbow darter" (see more on this below)
- AndrewT 01:15, 14 March 2008 (UTC)[reply]
- Done
- Yeah, I caught that
:)
- I'll wait till we see what everyone wants on the below thread
:)
- Anything else anyone sees? Mønobi 02:39, 14 March 2008 (UTC)[reply]
- Great work on this bot Mønobi. I wonder if the bot should be aware of ITIS's "Taxonomic Status" (valid/accepted vs. invalid/not accepted) and/or "Data Quality Indicators". For example, if a species is requested that is "invalid" on ITIS, for the sake of raising the standards here, should Monobot refuse to create the page? What about if a "valid" synonym is available - should it check for a page under that name instead? (And a small nit: Should there be no space between the "
==
" and section heading name? - such as:
"== References ==
". I know this only saves 2 bytes per section but assuming at least two sections per page, that's a 4MB (uncompressed) savings on 1 million pages ;-) --Georgeryp 04:13, 14 March 2008 (UTC)[reply]
- Great work on this bot Mønobi. I wonder if the bot should be aware of ITIS's "Taxonomic Status" (valid/accepted vs. invalid/not accepted) and/or "Data Quality Indicators". For example, if a species is requested that is "invalid" on ITIS, for the sake of raising the standards here, should Monobot refuse to create the page? What about if a "valid" synonym is available - should it check for a page under that name instead? (And a small nit: Should there be no space between the "
- I'd like to see a couple of things ironed out before the flag is set.
- There should be a space between colon and the species name. [6] Done
- Wikilink authors. [7] Done
- Replace and by & in authors where applicable. [8] Done
- Do not generate ==Vernacular names== tag when no English/Spanish names are available. [9] Done
- We surely don't want to revisit every bot-generated lemma. Lycaon 16:12, 15 March 2008 (UTC)[reply]
- Anything else? Mønobi 17:08, 15 March 2008 (UTC)[reply]
- Need to put a bullet before {{ITIS|... }} in reference section (this was mentioned above but it's still occuring).
- No one commented on the issue raised above about taxons flagged as invalid on ITIS. At least in some cases of "invalid" taxons, ITIS also gives a valid synonym so I think MonoBot should check the status on ITIS and either refuse to make the taxon or create the valid synonym instead. For example, see:
- Heliactin cornuta – Taxon details on Integrated Taxonomic Information System (ITIS)..
- Possible bug wikilinking authors inside parethesis? [10] --Georgeryp 19:14, 15 March 2008 (UTC)[reply]
- Good point rasied by Georgerypt in particular to invalid taxons. OhanaUnitedTalk page 19:23, 15 March 2008 (UTC)[reply]
- 1. Yeah, somewhere along that lines that got removed. I've readded it.
- 2. I'll have the bot refuse to create it.
- 3. Fixed.
- Mønobi 19:32, 15 March 2008 (UTC)[reply]
- Generate another 4 articles using the bot. If no problems were found, then I think your bot is good to go. OhanaUnitedTalk page 14:15, 17 March 2008 (UTC)[reply]
- No problems. Go ahead and flag it ? Mønobi 14:22, 21 March 2008 (UTC)[reply]
- Granted EVula // talk // ☯ // 15:41, 21 March 2008 (UTC)[reply]
- No problems. Go ahead and flag it ? Mønobi 14:22, 21 March 2008 (UTC)[reply]
- Generate another 4 articles using the bot. If no problems were found, then I think your bot is good to go. OhanaUnitedTalk page 14:15, 17 March 2008 (UTC)[reply]
- Mønobi 19:32, 15 March 2008 (UTC)[reply]
- The above discussion is preserved as an archive. Please do not modify it. Subsequent comments should be made on the appropriate discussion page, such as the current discussion page. No further edits should be made to this discussion.