"We've Got to Think About the Game"
I said I'd go into more detail about each of the activities in the Cyc Foundation's Cyclify initiative. The purpose that cuts across all activities is to grow the Cyc Knowledge Base. Initially, the focus is on what we call breadth or coverage. We want everything one could think of to have a concept term or a way of forming one from functional terms. A concept term is something like #$CreditCard. A term built functionally is something like (#$BorderBetweenFn #$France #$Germany). The '#$' is something used internally by Cyc to distinguish Cyc terms from other kinds of symbols. If you're not doing programming, you don't have to use it. But sometimes it helps as a way of knowing that you're talking about #$France the Cyc term as opposed to "France" the string or France the country.
Anyway, we want to fill up Cyc with concepts and with useful facts about the concepts. As a result of the way development proceeded with fits and starts based on where funding was coming from, the coverage in Cyc is very broad but spotty. There may be terms for #$Parsley, #$Sage and #$Rosemary, but #$Thyme might be missing. (Sounds like a song.) As for facts, Cyc may know that yelling is louder than talking, but not know that talking is typically louder than whispering.
The games will be one way of helping to fill in the content. The first game presents the player with statements for which she must choose True, False or Skip. Since many of the questions are common sense things that we all know, challenge is introduced in the form of time pressure. The first game will also have other question types; for example, one type allows the player to choose several answers.
The game is fed by statements that you might call prospective facts. There are several ways to come up with these. One is to have Cyc use natural language generation to create partial search strings -- "The adult elephant can weigh as much as *" -- and parse what you get back. Another is to use abduction -- find things that might possibly be true and propose that they are true. The statements that feed the game are in what we call "the pipeline".
There are issues in creating the pipeline. We want to create useful prospective facts; and we also want to be able to categorize them, so that you as a player won't keep getting shown statements that you know nothing about. That's a challenge we need to address presently.
I'll continue with more about the game next time.
P.S. Bonus points for whoever can tell what musical the title of this post came from.