We’ve taken our database of over 1.3 million laws and extracted every two word pair from every law, organized by country. We then eliminated common words (e.g. “the”, “this”, etc.) and words that aren’t of much interest (e.g. “subparagraph”). By merging all of these word pairs and counts into files, we’ve created lists of the most common word pairs by country. You can download the 48 raw text files here: https://www.global-regulation.com/assets/visualization/48_country_word_pairs_may_2016.zip.
By using the word pairs as nodes and the connection to countries as edges, the text files can be merged to create a “graph” (a set of nodes and edges). The graphs were then written to text files in GEDF format. The file for Canada can be found here: https://www.global-regulation.com/assets/visualization/Canada-word_pairs.gdf.