Skip to content

CompNet/Sachan

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sachan

Story Adaptation and Character Networks

  • Copyright 2023-2024 Arthur Amalvy, Madeleine Janickyj, Shane Mannion, Pádraig MacCarron, and Vincent Labatut

Sachan is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation. For source availability and license information see licence.txt


If you use this source code or the associated dataset, please cite reference [A'24].

Description

This set of R and Python scripts aims at analyzing character networks extracted from G. R. R. Martin's A Song of Ice and Fire novels, and its adaptations into comics and the TV Show Game of Thrones.

The scripts tackle two tasks described in [A'24]. The first task is character matching, which consists in identifying pairs of vertices representing the same character in two of these networks, based on the graph structure (and some additional information). The second task is narrative matching, and consists in identifying pairs of narrative units (chapters, scenes, episodes...) that represent the same chunk of story in two different media.

Some of the scripts also allow to compute various descriptive statistics. Finally, the scripts also include some processing aiming at extracting the networks from the original raw data, and performing some cleaning. However, the clean networks themselves are also directly provided.

Data

The networks representing all three media are available online on Zenodo. This collection includes various types of dynamic graphs (instant vs. cumulative), computed using various narrative units: chapters for novels, scenes and chapters for comics, scene, blocks and episodes for the TV show. The Zenodo repository also includes the many files produced by the scripts.

StaticNet

Organization

Here are the folders composing the project:

  • Folder in: data used by the scripts.
    • Folder comics: networks related to the comics.
    • Folder novels: networks related to the novels.
    • Folder plot_alignement: data used for narrative matching.
    • Folder tvshow: networks related to the TV show.
  • Folder out: files produced by the scripts
    • Folder centrality: centrality study.
    • Folder descript: descriptive analysis.
    • Folder narrative_matching: results of the narrative matching task.
    • Folder vertex_matching: results of the character matching task.
    • Folder visualization: plots of the networks.
  • Folder src: source code.
    • Folder common: functions used in other scripts.
    • Folder descript: descriptive analysis.
    • Folder narrative_matching: narrative matching methods.
    • Folder preprocessing: extraction and cleaning of the networks.
    • Folder vertex_matching: character matching methods.
    • Folder visualization: graph plotting.

Installation

To execute the R scripts, you first need to install the language and the required packages:

  1. Install the R language
  2. Download this project from GitHub and unzip.
  3. Install the required packages:
    1. Open the R console.
    2. Set the unzipped directory as the working directory, using setwd("<my directory>").
    3. Run the install script src/_install.R (that may take a while).

For the Python scripts, ...TODO...

Use

In order to apply the R scripts:

  1. Open the R console.
  2. Set the current directory as the working directory, using setwd("<my directory>").
  3. Run the main script src/main.R.

In order to apply the Python scripts:

...TODO...

These scripts will produce a number of files in folder out.

Dependencies

Tested with R version 4.3.2, with the following packages:

Tested with Python version xxxx with the following packages:

  • xxxxx: version x.x.x. ...TODO...

To-do List

  • ...

References

  • [A'24]