Story Adaptation and Character Networks
- Copyright 2023-2024 Arthur Amalvy, Madeleine Janickyj, Shane Mannion, Pádraig MacCarron, and Vincent Labatut
Sachan is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation. For source availability and license information see licence.txt
- Lab site: http://lia.univ-avignon.fr/
- GitHub repo: https://github.com/CompNet/Sachan
- Data:
- Contacts: Vincent Labatut [email protected] / Pádraig MacCarron [email protected]
If you use this source code or the associated dataset, please cite reference [A'24].
This set of R
and Python
scripts aims at analyzing character networks extracted from G. R. R. Martin's A Song of Ice and Fire novels, and its adaptations into comics and the TV Show Game of Thrones.
The scripts tackle two tasks described in [A'24]. The first task is character matching, which consists in identifying pairs of vertices representing the same character in two of these networks, based on the graph structure (and some additional information). The second task is narrative matching, and consists in identifying pairs of narrative units (chapters, scenes, episodes...) that represent the same chunk of story in two different media.
Some of the scripts also allow to compute various descriptive statistics. Finally, the scripts also include some processing aiming at extracting the networks from the original raw data, and performing some cleaning. However, the clean networks themselves are also directly provided.
The networks representing all three media are available online on Zenodo. This collection includes various types of dynamic graphs (instant vs. cumulative), computed using various narrative units: chapters for novels, scenes and chapters for comics, scene, blocks and episodes for the TV show. The Zenodo repository also includes the many files produced by the scripts.
Here are the folders composing the project:
- Folder
in
: data used by the scripts.- Folder
comics
: networks related to the comics. - Folder
novels
: networks related to the novels. - Folder
plot_alignement
: data used for narrative matching. - Folder
tvshow
: networks related to the TV show.
- Folder
- Folder
out
: files produced by the scripts- Folder
centrality
: centrality study. - Folder
descript
: descriptive analysis. - Folder
narrative_matching
: results of the narrative matching task. - Folder
vertex_matching
: results of the character matching task. - Folder
visualization
: plots of the networks.
- Folder
- Folder
src
: source code.- Folder
common
: functions used in other scripts. - Folder
descript
: descriptive analysis. - Folder
narrative_matching
: narrative matching methods. - Folder
preprocessing
: extraction and cleaning of the networks. - Folder
vertex_matching
: character matching methods. - Folder
visualization
: graph plotting.
- Folder
To execute the R
scripts, you first need to install the language and the required packages:
- Install the
R
language - Download this project from GitHub and unzip.
- Install the required packages:
- Open the
R
console. - Set the unzipped directory as the working directory, using
setwd("<my directory>")
. - Run the install script
src/_install.R
(that may take a while).
- Open the
For the Python
scripts, ...TODO...
In order to apply the R
scripts:
- Open the
R
console. - Set the current directory as the working directory, using
setwd("<my directory>")
. - Run the main script
src/main.R
.
In order to apply the Python
scripts:
...TODO...
These scripts will produce a number of files in folder out
.
Tested with R
version 4.3.2, with the following packages:
cluster
: version 2.1.6.fmsb
: version 0.7.6.igraph
package: version 1.6.0.iGraphMatch
package: version 2.0.3.latex2exp
: version 0.9.6.plot.matrix
package: version 1.6.2.scales
: version 1.3.0.SDMTools
: version 1.1-221.2.viridis
package: version 0.6.4.XML
: version 3.99-0.16.1.
Tested with Python
version xxxx with the following packages:
xxxxx
: version x.x.x. ...TODO...
- ...
- [A'24]