AutoMan API Reference
Supported Question Types
Question Type
Purpose
Quality-Controlled
Number of Answers Returned
radio
The user is asked to choose one of n options.
yes
1
checkbox
The user is asked to choose one of m of n options, where m <= n.
yes
1
freetext
The user is asked to enter a textual response, such that the response conforms to a simple pattern (a "picture clause").
yes
1
estimate
The user is asked to enter a numeric (real-valued) response.
yes
1
radios
Same as radio, except that it returns the entire distribution.
no
sample size
checkboxes
Same as checkbox, except that it returns the entire distribution.
no
sample size
freetexts
Same as freetext, except that it returns the entire distribution.
no
sample size
Calling a Question Type
We describe question type signatures below. It is important to note that calling a question type constructor immediately launches a crowdsoucing task. This is not usually what you want, which is why all of our sample applications utilize the following pattern:
def my_function(<arg>, ...) = <AutoMan constructor>(<configuration>)For example, here is a sample human function for calorie counting:
def howManyCals(imgUrl: String) = estimate (
budget = 6.00,
confidence_interval = SymmetricCI(50),
text = "Estimate how many calories (kcal) are " +
"present in the picture shown in the photo.",
image_url = imgUrl,
min_value = 0
)Observe how we use a Scala user-defined function (def) to pass the imgUrl parameter through to the estimate constructor. See our sample apps for additional examples.
Question Return Types
Another thing to note is that all AutoMan question constructors return a result belonging to the supertype Outcome. Although you can call toString on such return values to obtain a simple, printable string, you should probably pattern-match on the result value returned by calling answer (or answers, depending on the question) on the returned Outcome object. Each question type has a different set of possible return values. We describe them in the next section.
You are encouraged to look at the sample apps for examples.
The toString method for Outcome calls answer internally, which means that it blocks!
Question Type Constructor Signatures
We provide question constructors here. Note that all of them take a very large number of parameters, but that most of those parameters are the same between question types and nearly all of them have "sane defaults." Defaults are managed by requiring the use of named arguments.
Therefore, we provide two constructor signatures for each question type: the 1) simplified constructor showing only mandatory parameters, and 2) the full ScalaDoc-generated constructor with all parameters. We also describe common parameters at the end.
We describe the variants used in the mturk DSL here.
Radio Button Questions
The following constructor parameters are mandatory:
def radio(
options: List[MTQuestionOption],
text: String
(implicit a: A): ScalarOutcome[Symbol] optionsare the selection options seen by the user, along with optional images. Options can be created using one of the followingchoiceconstructors:choice(key: Symbol, text: String)orchoice(key: Symbol, text: String, image_url: String)
where
keydenotes a stable identifier for a choice (e.g.,kermit) not shown to the worker,textis the text label shown to the worker, andimage_urlis a url of an image shown beside the text label.textis the text of the question shown in aHITand, by default, also as the task title. You can override the title by setting thetitleparameter.
Radio button questions can return the following values:
Answer[Symbol]: An object that represents a selected radio button, where each possibleSymbolwas defined with thekeyparameter of thechoiceconstructors described above. This object has the following fields:value: the answer (Symbol).cost: the total cost (BigDecimal)confidence: the final confidence value (Double).distribution: raw sample responses (Array[Response[Symbol]])
LowConfidenceAnswer, which has the same fields asAnswerbut which indicates that a quality-controlled response has aconfidencelower than the desired threshold.OverBudgetAnswer, which indicates that a specified task cannot run at all due to insufficient funds. This object has the following fields:need: the funds needed to start a job (BigDecimal)have: the funds at hand (BigDecimal)
NoAnswer, which indicates that an unexpected runtime error occurred.
The following is a ScalaDoc-generated signature:
def radio[A <: AutomanAdapter, O](
confidence: Double = MagicNumbers.DefaultConfidence,
budget: BigDecimal = MagicNumbers.DefaultBudget,
dont_reject: Boolean = true,
dry_run: Boolean = false,
image_alt_text: String = null,
image_url: String = null,
initial_worker_timeout_in_s: Int = ...,
minimum_spawn_policy: MinimumSpawnPolicy = null,
mock_answers: Iterable[MockAnswer[Symbol]] = null,
options: List[AnyRef],
pay_all_on_failure: Boolean = true,
question_timeout_multiplier: Double = ...,
text: String,
title: String = null,
wage: BigDecimal = MagicNumbers.USFederalMinimumWage)
: ScalarOutcome[Symbol] Checkbox Questions
The following constructor parameters are mandatory:
def checkbox(
options: List[MTQuestionOption],
text: String)
: ScalarOutcome[Set[Symbol]] optionsare the selection options seen by the user, along with optional images. Options can be created using one of the followingchoiceconstructors:choice(key: Symbol, text: String)orchoice(key: Symbol, text: String, image_url: String)
where
keydenotes a stable identifier for a choice (e.g.,kermit) not shown to the worker,textis the text label shown to the worker, andimage_urlis a url of an image shown beside the text label.textis the text of the question shown in aHITand, by default, also as the task title. You can override the title by setting thetitleparameter.
Checkbox questions can return the following values:
Answer[Set[Symbol]]: An object that represents a set of selected checkboxes, where eachSymbolwas defined with thekeyparameter of thechoiceconstructors described above. This object has the following fields:value: the answer (Set[Symbol]).cost: the total cost (BigDecimal)confidence: the final confidence value (Double).distribution: raw sample responses (Array[Response[Set[Symbol]]])
LowConfidenceAnswer, which has the same fields asAnswerbut which indicates that a quality-controlled response has aconfidencelower than the desired threshold.OverBudgetAnswer, which indicates that a specified task cannot run at all due to insufficient funds. This object has the following fields:need: the funds needed to start a job (BigDecimal)have: the funds at hand (BigDecimal)
NoAnswer, which indicates that an unexpected runtime error occurred.
The following is a ScalaDoc-generated signature:
def checkbox[A <: AutomanAdapter, O](
confidence: Double = MagicNumbers.DefaultConfidence,
budget: BigDecimal = MagicNumbers.DefaultBudget,
dont_reject: Boolean = true,
dry_run: Boolean = false,
image_alt_text: String = null,
image_url: String = null,
initial_worker_timeout_in_s: Int = ...,
minimum_spawn_policy: MinimumSpawnPolicy = null,
mock_answers: Iterable[MockAnswer[Set[Symbol]]] = null,
options: List[AnyRef],
pay_all_on_failure: Boolean = true,
question_timeout_multiplier: Double = ...,
text: String,
title: String = null,
wage: BigDecimal = MagicNumbers.USFederalMinimumWage)
(implicit a: A)
: ScalarOutcome[Set[Symbol]] Free-Text Questions
The following constructor parameters are mandatory:
def freetext(
pattern: String,
text: String)
: ScalarOutcome[String] patternis a COBOL-style picture clause pattern that states what inputs are valid. AutoMan uses this pattern to perform probability calculations.Amatches an alphabetic character,Bmatches an optional alphabetic character,Xmatches an alphanumeric character,Ymatches an optional alphanumeric character,9matches a numeric character, and0matches an optional numeric character. For example, a telephone number recognition application might use the pattern09999999999.textis the text of the question shown in aHITand, by default, also as the task title. You can override the title by setting thetitleparameter.
The following parameters are freetext-specific:
allow_empty_patternmeans that the empty string is a valid worker response. default:falsebefore_filteris not currently used.pattern_error_textis a helpful message that is displayed to the user when their input does not match a pattern. It is not mandatory but it is highly recommended that you use this setting.
Free-text questions can return the following values:
Answer[String]: An object that represents a response string. This object has the following fields:value: the answer (String).cost: the total cost (BigDecimal)confidence: the final confidence value (Double).distribution: raw sample responses (Array[Response[String]])
LowConfidenceAnswer, which has the same fields asAnswerbut which indicates that a quality-controlled response has aconfidencelower than the desired threshold.OverBudgetAnswer, which indicates that a specified task cannot run at all due to insufficient funds. This object has the following fields:need: the funds needed to start a job (BigDecimal)have: the funds at hand (BigDecimal)
NoAnswer, which indicates that an unexpected runtime error occurred.
The following is a ScalaDoc-generated signature:
def freetext[A <: AutomanAdapter](
allow_empty_pattern: Boolean = false,
confidence: Double = MagicNumbers.DefaultConfidence,
before_filter: (String) ⇒ String = (a: String) => a,
budget: BigDecimal = MagicNumbers.DefaultBudget,
dont_reject: Boolean = true,
dry_run: Boolean = false,
image_alt_text: String = null,
image_url: String = null,
initial_worker_timeout_in_s: Int = ...,
minimum_spawn_policy: MinimumSpawnPolicy = null,
mock_answers: Iterable[MockAnswer[String]] = null,
pay_all_on_failure: Boolean = true,
pattern: String,
pattern_error_text: String = null,
question_timeout_multiplier: Double = ...,
text: String,
title: String = null,
wage: BigDecimal = MagicNumbers.USFederalMinimumWage)
(implicit a: A)
: ScalarOutcome[String] Estimates
There is an entire paper (VoxPL) about this one question type.
The following constructor parameters are mandatory:
def estimate(
confidence_interval: ConfidenceInterval,
text: String)
: EstimationOutcome confidence_intervallets you denote the confidence interval of an estimate. The options are:UnconstrainedCI()which will only even perform one round of tasks using the default sample size, returning the median.SymmetricCI(err: Double)which returns the medianerrwithconfidencelevel confidence.AsymmetricCI(lerr: Double, herr: Double)which returns the median of an estimate between-lerrrand+herrwithconfidencelevel confidence.
textis the text of the question shown in aHITand, by default, also as the task title. You can override the title by setting thetitleparameter.
Estimates can return the following values:
Estimate: An object that represents a "best estimate". This object has the following fields:value: the estiamte (Double).low: the low bound of a confidence interval's estimate (Double).high: the high bound of a confidence interval's estimate (Double).cost: the total cost (BigDecimal)confidence: the final confidence value (Double).distribution: raw sample responses (Array[Response[Double]])
LowConfidenceEstimate, which has the same fields asEstimatebut which indicates that a quality-controlled response has aconfidencelower than the desired threshold.OverBudgetEstimate, which indicates that a specified task cannot run at all due to insufficient funds. This object has the following fields:need: the funds needed to start a job (BigDecimal)have: the funds at hand (BigDecimal)
NoEstimate, which indicates that an unexpected runtime error occurred.
The following is a ScalaDoc-generated signature:
def estimate[A <: AutomanAdapter](
confidence_interval: ConfidenceInterval = UnconstrainedCI(),
confidence: Double = MagicNumbers.DefaultConfidence,
budget: BigDecimal = MagicNumbers.DefaultBudget,
default_sample_size: Int = -1,
dont_reject: Boolean = true,
dry_run: Boolean = false,
estimator: (Seq[Double]) ⇒ Double = null,
image_alt_text: String = null,
image_url: String = null,
initial_worker_timeout_in_s: Int = ...,
max_value: Double = Double.MaxValue,
minimum_spawn_policy: MinimumSpawnPolicy = null,
min_value: Double = Double.MinValue,
mock_answers: Iterable[MockAnswer[Double]] = null,
pay_all_on_failure: Boolean = true,
question_timeout_multiplier: Double = ...,
text: String,
title: String = null,
wage: BigDecimal = MagicNumbers.USFederalMinimumWage)
(implicit a: A)
: EstimationOutcome Sampling Questions
We describe the checkboxes constructor here, but freetexts and radios are similar. There is also a buggy multiestimate constructor that should probably not be used at the moment.
The following constructor parameters are mandatory:
def checkboxes(
sample_size: Int = ...,
options: List[MTQuestionOption],
text: String)
: VectorOutcome[Set[Symbol]] sample_sizeis the size of the sample.optionsare the selection options seen by the user, along with optional images. Options can be created using one of the followingchoiceconstructors:choice(key: Symbol, text: String)orchoice(key: Symbol, text: String, image_url: String)
where
keydenotes a stable identifier for a choice (e.g.,kermit) not shown to the worker,textis the text label shown to the worker, andimage_urlis a url of an image shown beside the text label.textis the text of the question shown in aHITand, by default, also as the task title. You can override the title by setting thetitleparameter.
The following is a ScalaDoc-generated signature:
def checkboxes[A <: AutomanAdapter, O](
sample_size: Int = ...,
budget: BigDecimal = MagicNumbers.DefaultBudget,
dont_reject: Boolean = true,
dry_run: Boolean = false,
image_alt_text: String = null,
image_url: String = null,
initial_worker_timeout_in_s: Int = ...,
minimum_spawn_policy: MinimumSpawnPolicy = null,
mock_answers: Iterable[MockAnswer[Set[Symbol]]] = null,
options: List[AnyRef],
pay_all_on_failure: Boolean = true,
question_timeout_multiplier: Double = ...,
text: String,
title: String = null,
wage: BigDecimal = MagicNumbers.USFederalMinimumWage)
(implicit a: A): VectorOutcome[Set[Symbol]] Common Default Parameters
The following are parameters common to all calls:
budgetis the total amount of money to be spent by a given human question function call. Note that this means that each function call has its own budget. default: $5.00dont_reject, when set totrue, will always accept completed assignments and pay workers for their work. This is useful when work is difficult and errors are likely, or when you just don't want to deal with the hassle of reputation management. default:falsedry_run, when set totrue, will not actually post jobs on MTurk. default:falseimage_alt_textadds an HTMLALTannotation to theIMGtag created by theimage_urlparameter. default: none (null)image_urladds an image to a question. Such images should be hosted someplace publically-accessible, such as Amazon S3 or a personal website. default: none (null)initial_worker_timeout_in_sis the amount of time permitted to a worker in the initial round of tasks. Note that the actual time permitted depends on the number of rounds and is determined by the quality control policy. The default policy uses the formula , where is theinitial_worker_timeout_in_s, , and is the round. In other words, task timeout are doubled. default: 30 secondsminimum_spawn_policystates what the smallest number of assignments for a givenHITare on MTurk. This is necessary because MTurk has two totally boneheaded policies:HITs posted with 10 or fewer assignments are charged a 20% fee while HITs with more than 10 assignments are charged a 40% fee.
HITs with 10 or fewer assignments cannot be "extended" to have more assignments.
For now, what this means is that, if you do not change the default, AutoMan will post tasks with at least 10 assignments. If you anticipate that your tasks will likely need fewer than 10 assignments, you can set the anticipated amount by setting this to
UserDefinableSpawnPolicy(n)wherenis the number you want. default: 10 note: I am actively unhappy about this and am thinking of ways to simplify it. Suggestions welcome.mock_answerssets AutoMan to be used in mock mode for testing purposes. This is used interally by AutoMan for testing. You should not change this. default:nullpay_all_on_failurecontrols whether workers are paid when a task runs out of money. Setting this tofalsemeans that workers will not be paid when anOverBudgetresult is returned, which generally makes workers unhappy. default:truequestion_timeout_multipliercontrols how much time a HIT exists on MTurk before it is timed out. Note that this is a distinct timeout from the amount of time a worker is given to complete a task, which is controlled by theinitial_worker_timeout_in_sparameter. A HIT's total time is determined by the formula , where is theinitial_worker_timeout_in_s, , is the round, and is thequestion_timeout_multiplier.default: 500
wagecontrols the base wage for a worker. The actual reward paid depends on how much time a worker is given to do a task. The default policy uses a maximum likelihood estimate of the probability that a task is accepted in order to compute a wage that disincentivzes wage gaming behavior. It is complicated enough that if you want to know its inner workings, you should read our 2016 CACM article. Generally you should think of the reward as "probably doubling." default: the U.S. Federal Minimum wage, or $7.25/hourais an initialized AutoMan platform adapter. Typically this will be animplicitvariable that you return from a platform initializer expression likemturk. When markedimplicit, you do not need to pass the parameter yourself; Scala will find it in the environment and pass it, simplifying human function calls. AutoMan needs this information in order to bind a human function call to a given crowdsourcing platform.
Using AutoMan with a Different Crowdsourcing Backend
We currently only support Amazon's Mechanical Turk. However, AutoMan was designed to accommodate arbitrary backends. If you are interested in seeing your crowdsourcing platform supported, please contact us.
Memoization
AutoMan can be configured to save all intermediate human-computed results. Set the location of the database with database_path = "/path/to/your/database". The format of the database is H2.
Sample Applications
Sample applications can be found in the apps directory. Apps can also be built using pack. E.g.,
$ cd apps/simple_program
$ sbt packUnix/DOS shell scripts for running the programs can then be found in apps/[the app]/target/pack/bin/.
Last updated
Was this helpful?