13 Resterampe
13.0.1 Character meta data
We will now combine this information with additional meta data about characters, i.e., gender.
# Proportion of male / female characters
barplot(table(text$characters$Gender),col=qd.colors)
13.0.2 Character groups
Next, we want to make the same analysis not for individual characters, but for character groups, based on categories such as gender.
ustat <- utteranceStatistics(rjmw.0, 
                             normalizeByDramaLength = FALSE # use absolute values
                            )
characterdata <- rjmw.0$characters
ustat <- merge(ustat, characterdata, 
               by.x = c("corpus","drama", "character"),
               by.y = c("corpus","drama", "figure_id"))
par(mar=c(2,2,2,2))
boxplot(utteranceLength ~ Gender, # what do we want to correlate 
        data=ustat, 
        las = 1 # rotate axis labels
      )
According to this picture, female characters speak slightly longer utterances in this play.