Project IV : 

 Google and Financial Crises 
 (Part 1) 




 

TL;DR. Can Google's PageRank measure be used to highlight and stop global financial contagions? Not in its raw form. In fact, in its raw form it could make things worse! But not all hope is lost...

 

 

 

Google can do no wrong in my eyes. Apart from that Google+ integration into YouTube --- that was horrendous. Anyway, from self--driving cars to wifi in weather balloons, and from contact lenses that measure glucose levels and diabeties to being every students' best friend when it comes to all--nighter essay writing, Google has been at the top of its game. In 16 short years (almost to the day) it really has become the backbone of our civilisation. Can it do even more than everyone thought it could do? Could it have the key to saving our economic world?

The overarching question I ask here is: "Can Google's PageRank measure be used to stop global financial contagions?" By "Google PageRank " I don't mean Google search results or its Zeitgeist (although, using those would be interesting too). Instead I mean the actual algorithm Google uses to sort and display search results in terms of each results' relevance to the search query.

The overarching question is pretty odd. What does a search algorithm have to do with financial folly through something like subprime mortgage provision? Nothing. So, how could Google's PageRank be used to stop financial crises spreading from one bank to the other, from one business to the other, and from one economy to the other? Well, I think PageRank might be a stepping--stone to being able to do this. However, in order to answer this we need to understand what exactly PageRank is, what it does, and where it can be applied. Without spoiling The answer has something to do with networks... obviously!




Why?

Before we even try to answer the overarching question we should probably ask "why bother looking at this?". Since the topics are so unrelated it seems like a pretty stupid question...

In 2008, after the collapse of the investment bank Lehman Brothers, Governments' across the world participated in bailling out their largest financial institutions. This immediately led to billions of dollars (pounds Sterling, euro whatever currency) worth of tax--payers money being transferred to initiatives, such as the Troubled Asset Relief Program (TARP), to purchase bad debt from institutions that poorly lent to borrowers. The result was a more consolidated financial sector, cartel formation and LIBOR fixing, a lengthy bout of recesson, and a schizophrenic recovery. It has been argued by many whether the actions conducted by Governments were in the best interest of society, or whether their actions and policies favoured the interests of large financial institutions. Moreover, it has been argued that the use and distribution of taxpayers money was inefficient.

If we can develop a measure to sort financial institutions in terms of their impact to the stablity of the network as a whole then perhaps we could make the bailing out process more efficient and stop the problems with the financial system before they escalate. Figuring out whether PageRank can be used to measure the relative importance of financial institutions could be extremely important!



Understanding PageRank

In network terms it is a measure of "centrality". The centrality of a node in a network is a measure that indicates the importance of a node in terms of some characteristic, which is typically associated with the nodes' degree (how many arcs or links it has in the network). There are a host of different centrality measures that make use of different characteristics. The most famous is PageRank developed by Sergey Brin and Larry Page (founders of Google Inc.), even though it is basically just the same as many others including the beta--measure developed by Gilles and Van den Brink who completely axiomatised the measure.

In its simplest form, the Wolrd Wide Web (WWW) can be projected as a directed network: webpages are represented as nodes and there is an arc from one node, i, to another, j, if there exists a hyperlink from page i to page j. When the WWW is projected in this way centrality measures are used by search engines to rank websites and provide more accurate search results, irrespective of the contextual nature of the search string.

Moreover, the structure of the WWW shares the same properties as many other networks, such as social networks, neural networks, e--mail and messaging conversations, protein, metabolic, and cancer cells, and economic / financial networks, which is very useful for this article! Indeed, it seems that the stucture (or topology) of all complex systems evolve a number of distinct properties:


(1)   The average distance (measured by shortest path length) between pairs of nodes in a social network tends to be small, and the maximum distance between any two pairs of nodes (called the diameter) is also small;

(2)   Clustering is higher than purely random, meaning that all networks have patches of dense connections;

(3)   Distribution of degrees of the nodes in a network exhibits fat tails, so there are more nodes with relatively high and low degrees and fewer nodes with medium degrees;

(4)   The degrees of linked nodes tend to be positively correlated, so that higher degree nodes are more likely to be linked to other higher degree nodes, and lower degree nodes are more likely to be linked to other lower degree nodes;

(5)   The clustering among the neighbors of a given node is inversely related to the node’s degree.


Due to these common properties we will find that there exists a few nodes that are more connected than others; that it does not take long to get from one node to any other node in the network; and that there will exists pockets of highly clustered nodes, i.e., cliques. These properties suggest that there must exist some nodes that are more important than others in terms of their connectivity and coverage in the network and also that any contagious process beginning from a single node (or set of nodes) can affect the entire network extremely quickly.

The first common centrality measure used, and the one that PageRank really built upon, was the Bonacich centrality measure which translates the network as an adjacency matrix, then takes the largest Eigenvalue of the matrix and assigns relative scores to all nodes based on the size of each nodes Eigenvalue. The outcome of this measure means that nodes that have important nodes linked to them will also be important themselves, and their importance increases with the number of nodes that link to the node.

Formally, the Bonancich centrality of some node i, denoted by  yi, can be expressed as


  bonacich  



Where the adjacency matrix of the network is denoted by G and the largest eigenvalue of the matrix is denoted by kappa. From the formula it's pretty easy to see that the Bonacich centrality of some node i is dependent on the other nodes j that connect, or 'point', to it.


black Example. Imagine that we were mapping the Twitter network of 'follower--following' relationships. We assume that all poeple and organisations are represented as a set of nodes and there exists an arc (a pointed arrow) from one node (Owen) to another (Tom) if Owen follows Tom. Indeed, if Owen follows Tom then Owen points to Tom.
Let's first assume that Owen follows (points to) Tom where Owen has very few followers and his followers also have very few followers; then Owen following Tom will not increase Tom's Bonacich centrality by very much since Owen is not important (does not have a high centrality) himself. Conversely, if Owen has a lot of followers and they also have a lot of followers then Owen following Tom will increase Toms centrality by a significant amount.


black




Bonacich centrality therefore prioritises nodes that have many nodes pointing to them, which also have many nodes pointing to them, and so on. Therefore, each node is ranked depending on how many ways each node can be reached --- the more the better.

An evolution of the Bonacich centrality measure is the Katz centrality which measures all nodes that can be connected through a path, and contributions to distant nodes are penalised.


  bonacich  



Where alpha is some positive parameter (above 0) that con be freely chosen. The PageRank measure then is an extension of both the Bonacich and Katz centrality measures. It simply applies a scaling factor to the Katz centrality. Specifically, the scaling factor that is derived from each nodes' neighbours is proportional to their centrality divided by the number of nodes pointing to them. Then nodes that point to many others pass only a small amount of centrality on to each of those others, even if their own centrality is high.

PageRank is a calculation applied to nodes (or webpages) in a network that measures a nodes' importance based on the relative importance of all other nodes that point (connect) to it, which is then scaled down by a factor depending on how many other nodes each node points to. In mathematical terms, the PageRank of some node  , denoted by  , is defined by:


  bonacich  



Where    refers to the set of neighbours of i in the network    and we define    as   .

It is directly based on the Bonacich and Katz measures (although Brin and Page do not even acknowledge these measures in their papers). It's not the most novel measure of centrality, but regardless, from the success of Google we can conclude that this is a pretty effective measure for ranking the importance of webpages.

So, what has this got to do with financial crises?




The connected nature of finance

We all know what banks are. They provide one of the most vital roles in society. We deposit our savings into them and they use these deposits to invest in entrepreneurial activities, business projects, mortgages, and other types of investment. The return on these investments leads to bank profit and provides interest on depositors money. They are the intermediaries between people that have excess money and people that want money. Simplifying somewhat, if the value of a banks assets (investments) are greater than its liabilities (the claims on it by others) then the bank becomes insolvent and extremely vulnerable to bank runs.

In order to honour all of its claims by depositors and continue lending to clients banks lend to each other. Sometimes banks have a level of claims on them that their current working capital cannot maintain, meaning that they have to borrow money from other institutions for varying periods of time and at a rate of interest determined by the prevailing LIBOR (London Inter-Bank Offered Rate), which is just the rate of interest at which banks lend between themselves.

Banks are therefore connected in a network in two ways. First, if a set of banks are invested in a given project or a given industry then they are connected indirectly through some form of overlapping portfolio. Second, one bank is directly connected to another if the bank lends money to the other. Specifically we could project these relationships as a directed network, much like we did with the WWW above.

If some bank, i, lends to another, j, then bank i points to bank j. The arcs in this network denote the flow of money from one bank (node) to another bank (node). Specifically, if there exists an arc from i to j then j is in debt to i.

Let's assume that some bank borrows a lot of money from another under the belief that the borrowing bank will repay the other the principal plus some interest at the rate of LIBOR. If the borrowing bank cannot repay the loan and defaults then the lending bank may become insolvent and unable to repay its liabilities to depositors and other financial institutions it borrowed from, etc. The default from one institution can cascade onto others when they are linked in a network: the failure of one bank or one investment can roll over onto others both directly and indirectly connected to it.

The transmission of financial crises therefore moves in the opposite direction of the arcs meaning that the node / financial intermediary with the largest impact on the stability of the network should be the node with the largest number of other nodes pointing to it. This is exactly how webpages are ranked by Google's PageRank: the more points the better.

A network can be a natural way to express both the WWW and the financial system. Further still, both can be represented as a directed network where one uses hyperlinks and the other uses loans to characterise the arcs. So can we use the same technique to highlight important financial institutions and banks as PageRank does to highlight important webpages?

From the description above we should find that the financial institutions which have more nodes pointing to it, i.e. more banks borrowing from it, should be those that are more crucial to the stability to the financial system as a whole. If a financial institution with a large number of other financial institutions lending to it were to fail then it would trigger a larger cascade of failure than the failure of a financial institution that borrows from only a single other node.

In the most superficial way PageRank seems like a great measure for the importance of financial intermediaries: it gives a higher importance to nodes that have a larger number of other node pointing to it.




The problems of PageRank

Some commentators have concluded that PageRank, or the newly known DebtRank, measure is the panacea to all future financial contagions... However, this might not exactly be the case. Although both the financial system and the WWW can be projected as networks, the context of these networks is different, even in the most general sense. I highligh two main complications here.

(1)   Nodes in a financial network have values attached to them that represent their assets and liabilities. A contagious process will only continue cascading in a financial network when the value of a nodes' liabilities exceeds the value of the nodes assets. Indeed, contagion is not necessarily automatic. And it could be that a bank that diversifies its borrowing, and therefore has a large number of other banks pointing towards to, could in actuality have less of an impact rather than more of an impact as PageRank would suggest!

(2)   PageRank was never created to measure dynamics on networks and assumes that the network remains static through time. However, as nodes fail they are effectively removed from the network and the PageRank of the nodes in the reduced network may become different. Moreover, with PageRank we cannot attach probabilities of failure to nodes and use a reduced network based on these probabilities.

Due to these problems, using PageRank to rank the importance of financial institutions and bailing out banks depending on their PageRank score could actually be detrimental to saving the financial system.

Despite PageRank being imperfect for assessing which financial institutions are most critical for the functioning of the financil system all is not lost. PageRank actually seems to be a pretty good basis in which to begin assessing the impact of banks when they suffer defaults and failure.




More in Part 2...

In Part 2 I will expand on the PageRank measure and provide an alteration that can be used for efficient vaccination of distressed financial institutions. This is an elaboration of PageRank but applied directly to financial institutions.

Maybe Google can save the world after all... with a little help!