About Dataset
A harmless fake NLP release for testing text previews, label descriptions, and download affordances in pre-release mocks.
Elevator Small Talk Corpus
Tiny synthetic text snippets labeled by how quickly everyone starts staring at the floor numbers.
kaggle datasets download -d awkward-nlp/elevator-small-talk-corpus
python - <<'PY'
import pandas as pd
rows = pd.read_csv("corpus/train.jsonl")
print(rows.describe(include="all"))
PY
A harmless fake NLP release for testing text previews, label descriptions, and download affordances in pre-release mocks.
Preview file for train.jsonl Preview-sample stats (3 rows).
|
Autterance_id
Link to Organization
|
123text
Text
|
123label
Label
|
Asplit
Split
|
|---|---|---|---|
33 unique · 0% null
| 33 unique · 0% null
| 33 unique · 0% null
| 22 unique · 0% null
|
| talk-001 | Weather is certainly happening today. | weather-neutral | train |
| talk-002 | This elevator has excellent vertical commitment. | situational | train |
| talk-003 | Mondays arrive with confidence. | calendar | validation |