API Analysis Result Interpretation

Analysis Result Interpretation Guide

The UPSTREAM and ANALYSIS requests returns a JSON object which contains analysis result.
Following table summarizes fields, values and their descriptions of the returned JSON object.

JSON Object Field Description Version Support Notes
{
“status”:”success”, The status of request. Can be “success” or “error”.
“result”:{ The object of analysis results.
  “duration”:”11155.13″, Duration of voice data processed in milliseconds
  “sessionStatus”:”Processing”, Session status can be:
“Started” – no analysis data yet produced,
“Processing” – intermediate results , more analysis can be expected,
“Done” – analysis session has ended, the result has an analysis results for whole session.
  “analysisSegments”:[ The array containing analysis segments
  { First analysis segment object. Following fields are  properties of the segment
     “offset”:0, Offset of the segment in milliseconds from the beginning of the session.
     “duration”:10000, Segment duration in milliseconds
     “end”:10000, The end of the segment in milliseconds V4 and above
     “analysis”:{ Analysis object. Contains analysis values for the segment. The content of the object is provided as example. The real fields can vary depending on license type
         “Temper”:{ Temper Object
           “Value”:”21.00″, Value of Temper
           “Group”:”low”, Group of Temper ambiguous value: V4 and above"unanalyzable"
value: V4 and above
           “Score”:”92.00″, Confidence score of Temper (92 % positive) V4 and above
},
        “Valence”:{ Valence Object. (similar to Temper object)
           “Value”:”23.00″,            Value of Valence
           “Group”:”negative”, Group of Valence ambiguous value: V4 and above"unanalyzable"
value: V4 and above
           “Score”:”94.00″, Confidence score of Valence (94 % positive) V4 and above
},
        “Arousal”:{ Arousal Object
           “Value”:”24.00″, Value of Arousal
           “Group”:”low”, Group of Arousal ambiguou value: V4 and above"unanalyzable"
value: V4 and above
           “Score”:”80.00″, Confidence score of Arousal (80% fit to the result Arousal) V4 and above
         },
        "Emotion_group":{ Emotion group object
" Group ":"Sadness/Uncertainty/Boredom" Category of Emotion group ***"inexplicit emotion" value: V5 and above
"Confidence_Score": "91.00" Confidence score of Emotion Group (91% fit to the result Emotion Group) V5 and above
"Vad":{ Voice activity detection
"Voiced":"62.00" Represents percent ratio of voice activity in the analyzed time window
           }
           }
           }
           {, Following analysis segment objects
….. ..
           },
           ],
"analysisSummary": { The object of analysis summary
"AnalysisResult":{ The object of analysis summary results
"Temper": { Temper Summary Object
"Mean": "8.00", Mean of Temper Values
"Mode": "low", Most frequent Temper Group
  “ModePct”:”100.00″ The Percentage of the most frequent Temper group of previously analyzed segments
  “Valence”:{ Valence Summary Object
"Mean": "2.80", Mean of Valence Values
  “Mode”:”negative” Most frequent Valence Group
  “ModePct”:”100.00″ The Percentage of the most frequent Valence group of previously analyzed segments
  },
  “Arousal”:{ Arousal Summary Object
"Mean": "9.00", Mean of Arousal Values
  “Mode”:”low” Most frequent Arousal Group
  “ModePct”:”100.00″ The Percentage of the most frequent Arousal group of previously analyzed segments
  },
"Emotion_group": { Emotion Group Summary Object V5 and above
"Mode": " Sadness/Uncertainty/Boredom", Most frequent Emotion Group V5 and above
"ModePct": "100.00" The Percentage of the most frequent Emotiongroup of previously analyzed segments V5 and above
  }
  }
  }
  }
  }

*Ambiguous value – is displayed when the confidence score is too low

**Unanalyzable value – is displayed when the segment cannot be explained or interpreted through
methodical examination. Possible reasons:

  • Segment that is composed of mostly silence, when the Voice Activity Detection (VAD) parameter is less than 15%.
  • Bad quality of recording – presence of very high background noise in the segment.
  • Speech/voice in the segment does not sufficiently adhere to the acoustic model, and as a result is not analyzed.

***Inexplicit value – is displayed when Temper or Valence have "ambiguous" value.