IRC Bots: Difference between revisions

From Lojban
Jump to navigation Jump to search
No edit summary
 
(7 intermediate revisions by 4 users not shown)
Line 5: Line 5:
==Lojban assistance==
==Lojban assistance==


==='''valsi'''===
==='''vlaste'''===
This is a bidirectional Lojban-English dictionary. It works from a dump of [http://jbovlaste.lojban.org/ Jbovlaste], so it may not be completely up to date.
This is a bidirectional Lojban-English dictionary, run by [[la durka]]. It works from a dump of [http://jbovlaste.lojban.org/ Jbovlaste], so it may not be completely up to date.


Run by [[la donri]] ([https://github.com/lojban/vlasisku/blob/master/vlasisku/irc.py source]).
Run by [[la donri]] ([https://github.com/lojban/vlasisku/blob/master/vlasisku/irc.py source]).
Line 33: Line 33:
:* I don't see this option in the source, so maybe what's running is a slightly different version? [[User:Durka42|Durka42]] ([[User talk:Durka42|talk]])
:* I don't see this option in the source, so maybe what's running is a slightly different version? [[User:Durka42|Durka42]] ([[User talk:Durka42|talk]])
* You can also send queries in vlasisku's internal query language, like "valsi class:BAI" or "valsi affix:ka'i".
* You can also send queries in vlasisku's internal query language, like "valsi class:BAI" or "valsi affix:ka'i".
==='''vlaste'''===
Another instance of '''valsi''', run by [[la durka]], which contains an updated jbovlaste database, bug fixes and new features.
It supports the following additional commands, which '''valsi''' does not (until it is updated):
* lujvo
* lujvo
: Combine a tanru into a lujvo, using [[jvocu'adju]]. If the canonical form is defined, the definition is shown.
: Combine a tanru into a lujvo, using [[jvocu'adju]]. If the canonical form is defined, the definition is shown.
Line 42: Line 38:
* finti
* finti
: Show the [[jbovlaste]] user who created the word.
: Show the [[jbovlaste]] user who created the word.
* gimka
: Shows whether the input gismu collides with any existing official or experimental gismu.


Also, its lujvo-breaking algorithm for ''(components)'' is backed by [[camxes]], so it will call you out on a badly-constructed lujvo (e.g. a [[tosmabru]]), unlike '''valsi'''.
Also, its lujvo-breaking algorithm for ''(components)'' is backed by [[camxes]], so it will call you out on a badly-constructed lujvo (e.g. a [[tosmabru]]), unlike '''valsi'''.
Line 56: Line 54:
There are no options
There are no options
-->
-->
==='''camxes'''===
==='''camxes'''===
A grammar checking bot based on [[camxes]].
A grammar checking bot based on [[camxes]].
Line 71: Line 70:
: Don't show elidible terminators.
: Don't show elidible terminators.
* +exp (or -std)
* +exp (or -std)
: Parse using the experimental grammar.
: Parse using the [[ilmentufa]] experimental grammar.
* -exp (or +std)
* -exp (or +std)
: Parse using the standard grammar (default).
: Parse using the standard [[BPFK]] grammar (default).


==='''mensi''', '''livla'''===
==='''mensi'''===
Kitchen sink bot run by [[la gleki]] ([https://github.com/lagleki/glekitufa/tree/master/ircbot source]).
Kitchen sink bot run by [[la gleki]] ([https://github.com/lagleki/glekitufa/ source]).


Commands recognized by mensi:
Commands recognized by mensi:
*"'''off:'''", "'''exp:'''"
*"'''.ilm '''", "'''.beta '''"
**These are parser options based on [[camxes]]. '''off''' parses regular Lojban while '''exp''' has (most of?) the [[ELG]] implemented.
**These are parser options based on [[camxes]]. '''ilm''' parses regular [[BPFK]] Lojban while '''beta''' uses an experimental grammar of la [[ilmentufa]].
**You send this bot a sentence by prefixing it with the parser name and a colon, e.g. "off: coi". You can also private-message it (the prefixes are also needed there).
**You send this bot a sentence by prefixing it with the parser name and a colon, e.g. ".ilm coi". You can also private-message it (the prefixes are also needed there).
**Accepted options (put them before the sentence, like "off:+s coi"):
**Accepted options (put them before the sentence, like ".ilm+mts coi"):
*** +s
***s - with spaces
**** Show the selma'o of each word.
***m - with morphology
*** -f
 
**** Don't show terminators.
***t - with terminators
*** -f+s
***c - with selmaho
**** Both of the above.
 
***n - with node labels
***j - in JSON format
 
*[[jbofihe]] parser
*[[jbofihe]] parser
**"gerna:" parses using jbofihe, which was the de facto standard parser before camxes.
**".gerna " parses using jbofihe, which was the de facto standard parser before camxes.
*[[John Cowan]]'s parser
*[[John Cowan]]'s parser
**"cowan:" parses using the (very old) [[Official LLG Parser|official LLG parser]], based on the YACC grammar.
**".cowan " parses using the (very old) [[Official LLG Parser|official LLG parser]], based on the YACC grammar.
*Searching jbovlaste with mensi:
*Searching jbovlaste with mensi:
**Entering "en:coi" into the chat will output the English definition, notes and the author.
**Entering ".en coi" into the chat will output the English definition, notes and the author.
*showing etymology with mensi: "krasi:mlatu" will output Lojbanized sources for this gismu.
*showing etymology with mensi: ".krasi mlatu" will output Lojbanized sources for this gismu.
***lujvo are expanded and its structure is shown. If no lujvo is found it searches for other lujvo with the same se rafsi.
***lujvo are expanded and its structure is shown. If no lujvo is found it searches for other lujvo with the same se rafsi.
**Other languages are supported: "jbo:", "ja:" (Japanese), "ru:" (Russian), "fr-facile:" (Easy French), "de:" (German), "es:" (Spanish), "eo:" (Esperanto). The databases are autoupdated once every 3 days.
**Other languages are supported: ".jbo ", ".ja " (Japanese), ".ru " (Russian), ".f@ " (Easy French), ".de " (German), ".es " (Spanish), ".eo " (Esperanto), ".jb " (Lojban dictionary with examples).
**"/full " flag performs a search for a given sequence in definitions and notes. E.g. "en:/full greets".
**"/full " flag performs a search for a given sequence in definitions and notes. E.g. ".en /full greets".
*"Tatoeba:" outputs an arbitrary Lojban sentence from Tatoeba with a substring given
*"Tatoeba:" outputs an arbitrary Lojban sentence from Tatoeba with a substring given
*Alternative orthography with '''mensi''':
*Alternative orthography with '''mensi''':
**"rot13:" prefix performs ROT13 encoding on a given string
**".rot13 " prefix performs ROT13 encoding on a given string
**".kru " prefix performs conversion to la [[krulermorna]] orthography
**"mensi: j " performs [[jbopomofo]] encoding on a given string
**"mensi: j " performs [[jbopomofo]] encoding on a given string
**"mensi: r " performs Cyrillic encoding on a given string
**"mensi: r " performs Cyrillic encoding on a given string
Line 108: Line 111:
*** Delayed messaging service. Next time <nick> says something in the channel, '''mensi''' will relay the message. You can use regular expressions in the nick.
*** Delayed messaging service. Next time <nick> says something in the channel, '''mensi''' will relay the message. You can use regular expressions in the nick.
*Loglan to Lojban conversion
*Loglan to Lojban conversion
** "loi:" converts Lojban text to Loglan (not perfect!). Failed words are marked with "*".
** ".loi " converts Lojban text to Loglan (not perfect!). Failed words are marked with "*".
** "coi:" converts Loglan text to Lojban (not perfect!). Failed words are marked with "*".
** ".coi " converts Loglan text to Lojban (not perfect!). Failed words are marked with "*".
*Glossing
*Glossing
** "gloss:" converts Lojban text to glossed English words.
** ".gloss " converts Lojban text to glossed English words.
*lujvo building
*lujvo building
** "lujvo:" combines a given tanru into a list of lujvo and cmevla with their scores, using a method similar to [[jvocuhadju]]. The first (with the lowest score) lujvo are preferable in speech.
** ".lujvo " combines a given tanru into a list of lujvo and cmevla with their scores, using a method similar to [[jvocuhadju]]. The first (with the lowest score) lujvo are preferable in speech.


In the channel ##jboselbau, '''livla''' repeats sentences from [[Tatoeba]] every few minutes. If you catch any mistakes, you should probably tell someone, since it means a Lojban entry on Tatoeba is wrong.
===tersmus===
 
[https://gitlab.com/zugz/tersmu/ la tersmu] converts Lojban sentences into logic formulas. Type "tersmus: " or "tersmux: " followed by a sentence to use it publicly; drop the prefix in private messages. Add "jbo: " before the sentence for Lojbanized output.
===tersmus/tersmux===
 
[https://github.com/lagleki/tersmu-0.2 tersmu] converts Lojban sentences into logic formulas. Type "tersmus: " or "tersmux: " followed by a sentence to use it publicly; drop the prefix in private messages. Add "jbo: " before the sentence for Lojbanized output.


===Experimental grammars===
===Experimental grammars===


Some bots are testbeds for experimental grammar that may or may not find its way into common use. Currently all of them are based on camxes, but not fully compatible with either ''off'' or ''exp'' camxes.
Some bots are testbeds for experimental grammar that may or may not find its way into common use. Currently all of them are based on camxes, but not fully compatible with the official grammar.


* '''[[zantufa]]'''
* '''[[zantufa]]'''
: Extensive reforms to basic sentence structure, connectives, and mekso. Most experimental cmavo listed on jbovlaste are supported. The bot uses the last stable release by default; adding "+exp " before the sentence switches to the unstable version. Adding "+mal " switches to '''maltufa''', a version meant for parsing older texts that lacks many of the reforms. There is detailed [[zantufa|documentation]].
: Extensive reforms to basic sentence structure, connectives, and mekso. Most experimental cmavo listed on jbovlaste are supported. The bot uses the last stable release by default; adding "+exp " before the sentence switches to the unstable version. Adding "+mal " switches to '''maltufa''', a version meant for parsing older texts that lacks many of the reforms. Adding "+maf " switches to '''maftufa''', made to parse [http://selpahi.de/oz.html selpa'i's translation of The Wonderful Wizard of Oz]]. There is detailed [[zantufa|documentation]].


* '''[https://github.com/lagleki/glekitufa alta]''' (part of '''mensi''')
* '''[https://github.com/mezohe/ilmentufa spagetufa]'''
: Attempts to restore omitted parts of a sentence, so that each selma'o introduces only one kind of structure. Connectives influenced by zantufa. Unstable, and development is sporadic.
: Morphology reform, allowing new word shapes while preserving existing ones. Several changes taken from zantufa and alta. [[ce ki tau jau]] available as a runtime switch (send "+help" in private for a list of switches). Development stagnant.


* '''[https://github.com/mezohe/ilmentufa spagetufa]'''
* Another, older experimental grammar ([[ilmentufa]]) is part of both '''camxes''' and '''mensi''' bots. Most of its additions are also supported by the other experimental parsers.
: Morphology reform, allowing new word shapes while preserving existing ones. Several changes taken from zantufa and alta. [[ce ki tau jau]] available as a runtime switch (send "+help" in private for a list of switches). Unstable.


==Other==
==Other==
* '''sidju'''
* '''nuzba'''
: This is an instance of the Phenny IRC bot.
: Searches Twitter for posts containing the words "lojban", "jbobau", or "ロジバン", and posts them whenever the channel has been quiet for two minutes. Takes no commands.
: Run by [[Tene]] ([https://github.com/sbp/phenny source]).
* '''djisku'''
: Common commands (see the source for full documentation):
: Monitors the channel for new users. If a newbie doesn't get an answer within one minute of first talking, it posts a message asking for patience. You can have a PM sent to you whenever a newbie starts talking by saying "djisku: ko mi fanza". Unsubscribe with "djisku: ko mi na fanza".
** "sidju: tell <nick> <message>"
:: Delayed messaging service. Next time <nick> says something in the channel, '''sidju''' will relay the message. You can use * in the nick as a wildcard.
** Google/Wikipedia search: ".g <query>" or ".wik <query>"
:: Searches and returns a link to the first result.
** Link title lookup: ".title"
:: Opens the most recent link pasted in the channel, and retrieves its title.
* '''xorban'''
* '''xorban'''
: Xorban is a parser bot for another logical language (you guessed it, called Xorban). It's actually the same bot as '''camxes''' but with a different grammar.
: Xorban is a parser bot for another logical language (you guessed it, called Xorban). It's actually the same bot as '''camxes''' but with a different grammar.
Line 150: Line 143:
* '''stetudu'''
* '''stetudu'''
: An instance of [[User:Leslie Beaumont|Randall Holmes]]'s [http://math.boisestate.edu/~holmes/loglan.org/holmes_stuff/cefli.html parser] for [[Loglan]]. Takes a Loglan sentence prefixed with "stetudu: " in the channel or without a prefix in private messages. Add "+s " before the sentence to show rule names.
: An instance of [[User:Leslie Beaumont|Randall Holmes]]'s [http://math.boisestate.edu/~holmes/loglan.org/holmes_stuff/cefli.html parser] for [[Loglan]]. Takes a Loglan sentence prefixed with "stetudu: " in the channel or without a prefix in private messages. Add "+s " before the sentence to show rule names.
* '''djisku'''
* '''gua'''
: Monitors the channel for new users. If a newbie doesn't get an answer within one minute of first talking, it posts a message asking for patience. You can have a PM sent to you whenever a newbie starts talking by saying "djisku: ko mi fanza". Unsubscribe with "djisku: ko mi na fanza".
: Parses [[Gua\spi]], yet another logical language, using an incomplete grammar reconstructed by [[User:Cirko|cirko]]
* '''vreji'''
* '''vreji'''
: Logs Lojban utterances in the channel at http://www.lojban.org/irclogs/.
: Logs Lojban utterances in the channel at http://www.lojban.org/irclogs/.
: Run by a pile of Perl scripts.
: Run by a pile of Perl scripts.
: Does not respond to any commands, as far as I know.
: Does not respond to any commands, as far as I know.

Latest revision as of 18:27, 21 July 2017

A Guide to the IRC Bots of #lojban (also #ckule and ##jboselbau).

You might think all ~120 logged on users of #lojban are watching you and/or participating in the conversation. Well, no, we all live in different timezones and at any given moment of time most of them are inactive. But also, there are a bunch of robots! Some of them are even helpful. Read on for more.

Lojban assistance

vlaste

This is a bidirectional Lojban-English dictionary, run by la durka. It works from a dump of Jbovlaste, so it may not be completely up to date.

Run by la donri (source).

You can invoke valsi in the channel by prefixing a message with "valsi ", or send a private message to the bot (with no prefix).

The content of the message is just a word. If it's a Lojban word, you'll get its definition. If it's an English word, you'll get a list of suggested translations.

Accepted options for Lojban word lookup: (you can give up to one option, in parentheses, e.g. "valsi lercu'aca'a (components)")

  • affix
Get the affix (rafsi) form of the word.
  • class
Get the selma'o (grammatical category) of the word.
  • type
Get the type (cmavo/gismu/lujvo/etc) of the word.
  • notes
Return the notes field of the definition (without this option it isn't shown).
  • cll
Get the link to the section of the CLL that discusses the word (not always accurate).
  • url
Get the link to Vlasisku's page for the word.
  • components
Break a lujvo into its component gismu (works on nonce lujvo). Beware: lujvo with missing hyphens are still parsed, so you can't rely on this to check your construction.
  • rafsi
Treat the word as a rafsi for the purposes of the search (many cmavo are also rafsi of other words).
  • I don't see this option in the source, so maybe what's running is a slightly different version? Durka42 (talk)
  • You can also send queries in vlasisku's internal query language, like "valsi class:BAI" or "valsi affix:ka'i".
  • lujvo
Combine a tanru into a lujvo, using jvocu'adju. If the canonical form is defined, the definition is shown.
Relatedly, if you ask vlaste for a lujvo not in canonical form, and no definition is found, it will try to help you by silently correcting it to canonical form.
  • finti
Show the jbovlaste user who created the word.
  • gimka
Shows whether the input gismu collides with any existing official or experimental gismu.

Also, its lujvo-breaking algorithm for (components) is backed by camxes, so it will call you out on a badly-constructed lujvo (e.g. a tosmabru), unlike valsi.


camxes

A grammar checking bot based on camxes.

Run by la durka (source).

You can invoke camxes in the channel by prefix a message with "camxes: ", or send a private message to the bot (with no prefix).

The message is just a Lojban sentence. If it parses, you'll get the structure. If it doesn't parse, it'll try to tell you where you went wrong.

Accepted flags for parsing: (put them before the sentence, like "camxes: +s-f coi"; you can put any number of flags in any order)

  • +s
Show the selma'o of each word in the sentence.
  • -f
Don't show elidible terminators.
  • +exp (or -std)
Parse using the ilmentufa experimental grammar.
  • -exp (or +std)
Parse using the standard BPFK grammar (default).

mensi

Kitchen sink bot run by la gleki (source).

Commands recognized by mensi:

  • ".ilm ", ".beta "
    • These are parser options based on camxes. ilm parses regular BPFK Lojban while beta uses an experimental grammar of la ilmentufa.
    • You send this bot a sentence by prefixing it with the parser name and a colon, e.g. ".ilm coi". You can also private-message it (the prefixes are also needed there).
    • Accepted options (put them before the sentence, like ".ilm+mts coi"):
      • s - with spaces
      • m - with morphology
      • t - with terminators
      • c - with selmaho
      • n - with node labels
      • j - in JSON format
  • jbofihe parser
    • ".gerna " parses using jbofihe, which was the de facto standard parser before camxes.
  • John Cowan's parser
  • Searching jbovlaste with mensi:
    • Entering ".en coi" into the chat will output the English definition, notes and the author.
  • showing etymology with mensi: ".krasi mlatu" will output Lojbanized sources for this gismu.
      • lujvo are expanded and its structure is shown. If no lujvo is found it searches for other lujvo with the same se rafsi.
    • Other languages are supported: ".jbo ", ".ja " (Japanese), ".ru " (Russian), ".f@ " (Easy French), ".de " (German), ".es " (Spanish), ".eo " (Esperanto), ".jb " (Lojban dictionary with examples).
    • "/full " flag performs a search for a given sequence in definitions and notes. E.g. ".en /full greets".
  • "Tatoeba:" outputs an arbitrary Lojban sentence from Tatoeba with a substring given
  • Alternative orthography with mensi:
    • ".rot13 " prefix performs ROT13 encoding on a given string
    • ".kru " prefix performs conversion to la krulermorna orthography
    • "mensi: j " performs jbopomofo encoding on a given string
    • "mensi: r " performs Cyrillic encoding on a given string
  • Messaging
    • "mensi: doi <nick> <message>"
      • Delayed messaging service. Next time <nick> says something in the channel, mensi will relay the message. You can use regular expressions in the nick.
  • Loglan to Lojban conversion
    • ".loi " converts Lojban text to Loglan (not perfect!). Failed words are marked with "*".
    • ".coi " converts Loglan text to Lojban (not perfect!). Failed words are marked with "*".
  • Glossing
    • ".gloss " converts Lojban text to glossed English words.
  • lujvo building
    • ".lujvo " combines a given tanru into a list of lujvo and cmevla with their scores, using a method similar to jvocuhadju. The first (with the lowest score) lujvo are preferable in speech.

tersmus

la tersmu converts Lojban sentences into logic formulas. Type "tersmus: " or "tersmux: " followed by a sentence to use it publicly; drop the prefix in private messages. Add "jbo: " before the sentence for Lojbanized output.

Experimental grammars

Some bots are testbeds for experimental grammar that may or may not find its way into common use. Currently all of them are based on camxes, but not fully compatible with the official grammar.

Extensive reforms to basic sentence structure, connectives, and mekso. Most experimental cmavo listed on jbovlaste are supported. The bot uses the last stable release by default; adding "+exp " before the sentence switches to the unstable version. Adding "+mal " switches to maltufa, a version meant for parsing older texts that lacks many of the reforms. Adding "+maf " switches to maftufa, made to parse selpa'i's translation of The Wonderful Wizard of Oz]. There is detailed documentation.
Morphology reform, allowing new word shapes while preserving existing ones. Several changes taken from zantufa and alta. ce ki tau jau available as a runtime switch (send "+help" in private for a list of switches). Development stagnant.
  • Another, older experimental grammar (ilmentufa) is part of both camxes and mensi bots. Most of its additions are also supported by the other experimental parsers.

Other

  • nuzba
Searches Twitter for posts containing the words "lojban", "jbobau", or "ロジバン", and posts them whenever the channel has been quiet for two minutes. Takes no commands.
  • djisku
Monitors the channel for new users. If a newbie doesn't get an answer within one minute of first talking, it posts a message asking for patience. You can have a PM sent to you whenever a newbie starts talking by saying "djisku: ko mi fanza". Unsubscribe with "djisku: ko mi na fanza".
  • xorban
Xorban is a parser bot for another logical language (you guessed it, called Xorban). It's actually the same bot as camxes but with a different grammar.
Run by la selpa'i.
  • stetudu
An instance of Randall Holmes's parser for Loglan. Takes a Loglan sentence prefixed with "stetudu: " in the channel or without a prefix in private messages. Add "+s " before the sentence to show rule names.
  • gua
Parses Gua\spi, yet another logical language, using an incomplete grammar reconstructed by cirko
  • vreji
Logs Lojban utterances in the channel at http://www.lojban.org/irclogs/.
Run by a pile of Perl scripts.
Does not respond to any commands, as far as I know.