Austen#
Downloaded from Project Gutenberg. Emma and Pride and Prejudice are free from copyright in the US and in most countries, according to Project Gutenberg. Please check laws in your country before use.
undefined
Initialization#
library(fosdata)
data <- fosdata::austenAccessing fields#
data <- fosdata::austen
word_length <- data$word_length # Just a random field in the datasetInteractive R Sample#
You can use the R editor below to interactively explore the dataset and generate plots. This contains a fully self-contained R environment with fosdata, ggplot2, and dplyr loaded.
Console
Plot
No plot generated yet.
LLM instructions#
If using an LLM, you can copy-paste the following instructions to accompany your prompt to inform the model of the fields and their types in the dataset.
LLM Instructions
The fosdata::austen dataset containing the following fields:
fields[7]{name,type,values}:
word,character,n/a
sentence,integer,n/a
chapter,integer,n/a
word_length,integer,n/a
stop_word,logical,[FALSE,TRUE]
sentiment_score,integer,n/a
novel,character,[Emma,Pride and Prejudice]Fields#
| Name | Description | Type | Min | Max | Values |
|---|---|---|---|---|---|
word |
A word in either Emma or Pride and Prejudice | character | - | - | - |
sentence |
The sentence number of the book that the word appears in. | integer | 1 | 9340 | - |
chapter |
The chapter of the book that the word appears in. | integer | 1 | 61 | - |
word_length |
The length of the word. | integer | 1 | 19 | - |
stop_word |
Is the word a stop word? Stop word are words such as “the” “and” or “of,” which are common and don’t carry sentiment. | logical | - | - | FALSE, TRUE |
sentiment_score |
Sentiment score of the word. Larger numbers correspond to more positive sentiment. | integer | -4 | 4 | - |
novel |
One of Emma or Pride and Prejudice. | character | - | - | Emma, Pride and Prejudice |
Source#
https://www.gutenberg.org/files/158/158-0.txt and https://www.gutenberg.org/files/1342/1342-0.txt