Austen#

Downloaded from Project Gutenberg. Emma and Pride and Prejudice are free from copyright in the US and in most countries, according to Project Gutenberg. Please check laws in your country before use.

undefined

Initialization#

library(fosdata)
data <- fosdata::austen

Accessing fields#

data <- fosdata::austen
word_length <- data$word_length # Just a random field in the dataset

Interactive R Sample#

You can use the R editor below to interactively explore the dataset and generate plots. This contains a fully self-contained R environment with fosdata, ggplot2, and dplyr loaded.

webR + fosdata Test

Console

Plot

No plot generated yet.

LLM instructions#

If using an LLM, you can copy-paste the following instructions to accompany your prompt to inform the model of the fields and their types in the dataset.

LLM Instructions

The fosdata::austen dataset containing the following fields:

fields[7]{name,type,values}:
  word,character,n/a
  sentence,integer,n/a
  chapter,integer,n/a
  word_length,integer,n/a
  stop_word,logical,[FALSE,TRUE]
  sentiment_score,integer,n/a
  novel,character,[Emma,Pride and Prejudice]

Fields#

Name	Description	Type	Min	Max	Values
`word`	A word in either Emma or Pride and Prejudice	character	-	-	-
`sentence`	The sentence number of the book that the word appears in.	integer	1	9340	-
`chapter`	The chapter of the book that the word appears in.	integer	1	61	-
`word_length`	The length of the word.	integer	1	19	-
`stop_word`	Is the word a stop word? Stop word are words such as “the” “and” or “of,” which are common and don’t carry sentiment.	logical	-	-	`FALSE`, `TRUE`
`sentiment_score`	Sentiment score of the word. Larger numbers correspond to more positive sentiment.	integer	-4	4	-
`novel`	One of Emma or Pride and Prejudice.	character	-	-	`Emma`, `Pride and Prejudice`

Source#

https://www.gutenberg.org/files/158/158-0.txt and https://www.gutenberg.org/files/1342/1342-0.txt