Austen#

Downloaded from Project Gutenberg. Emma and Pride and Prejudice are free from copyright in the US and in most countries, according to Project Gutenberg. Please check laws in your country before use.

undefined

Initialization#

library(fosdata)
data <- fosdata::austen

Accessing fields#

data <- fosdata::austen
word_length <- data$word_length # Just a random field in the dataset

Interactive R Sample#

You can use the R editor below to interactively explore the dataset and generate plots. This contains a fully self-contained R environment with fosdata, ggplot2, and dplyr loaded.

webR + fosdata Test

Console
Plot

    
No plot generated yet.
scatterplot

LLM instructions#

If using an LLM, you can copy-paste the following instructions to accompany your prompt to inform the model of the fields and their types in the dataset.

LLM Instructions
The fosdata::austen dataset containing the following fields:

fields[7]{name,type,values}:
  word,character,n/a
  sentence,integer,n/a
  chapter,integer,n/a
  word_length,integer,n/a
  stop_word,logical,[FALSE,TRUE]
  sentiment_score,integer,n/a
  novel,character,[Emma,Pride and Prejudice]

Fields#

Name Description Type Min Max Values
word A word in either Emma or Pride and Prejudice character - - -
sentence The sentence number of the book that the word appears in. integer 1 9340 -
chapter The chapter of the book that the word appears in. integer 1 61 -
word_length The length of the word. integer 1 19 -
stop_word Is the word a stop word? Stop word are words such as “the” “and” or “of,” which are common and don’t carry sentiment. logical - - FALSE, TRUE
sentiment_score Sentiment score of the word. Larger numbers correspond to more positive sentiment. integer -4 4 -
novel One of Emma or Pride and Prejudice. character - - Emma, Pride and Prejudice

Source#

https://www.gutenberg.org/files/158/158-0.txt and https://www.gutenberg.org/files/1342/1342-0.txt