Dependency graph

In mathematics, computer science and digital electronics, a dependency graph is a directed graph representing dependencies of several objects towards each other. It is possible to derive an evaluation order or the absence of an evaluation order that respects the given dependencies from the dependency graph.

Definition

A depends on B and C; B depends on D

Given a set of objects and a transitive relation with modeling a dependency "a depends on b" ("a needs b evaluated first"), the dependency graph is a graph with the transitive reduction of R.

For example, assume a simple calculator. This calculator supports assignment of constant values to variables and assigning the sum of exactly two variables to a third variable. Given several equations like "A = B+C; B = 5+D; C=4; D=2;", then and . You can derive this relation directly: A depends on B and C, because you can add two variables if and only if you know the values of both variables. Thus, B must be calculated before A can be calculated. Since B depends on D to be calculated, A must also depend on D to be calculated before it (hence the transitive property stated above). On the other hand, the values of C and D are known immediately, because they are number literals.

Recognizing impossible evaluations

In a dependency graph, cycles of dependencies (also called circular dependencies) lead to a situation in which no valid evaluation order exists, because none of the objects in the cycle may be evaluated first. If a dependency graph does not have any circular dependencies, it forms a directed acyclic graph, and an evaluation order may be found by topological sorting. Most topological sorting algorithms are also capable of detecting cycles in their inputs; however, it may be desirable to perform cycle detection separately from topological sorting in order to provide appropriate handling for the detected cycles.

Assume the simple calculator from before. The equation system "A=B; B=D+C; C=D+A; D=12;" contains a circular dependency formed by A, B and C, as B must be evaluated before A, C must be evaluated before B, and A must be evaluated before C.

Deriving an evaluation order

A correct evaluation order is a numbering of the objects that form the nodes of the dependency graph so that the following equation holds: with . This means, if the numbering orders two elements and so that will be evaluated before , then must not depend on .

There can be more than one correct evaluation order. In fact, a correct numbering is a topological order, and any topological order is a correct numbering. Thus, any algorithm that derives a correct topological order derives a correct evaluation order.

Assume the simple calculator from above once more. Given the equation system "A = B+C; B = 5+D; C=4; D=2;", a correct evaluation order would be (D, C, B, A). However, (C, D, B, A) is a correct evaluation order as well.

Monoid structure

An acyclic dependency graph corresponds to a trace of a trace monoid as follows:[1]: 12 

  • A function labels each vertex with a symbol from the alphabet
  • There is an edge or if and only if is in the dependency relation .
  • Two graphs are considered to be equal if their labels and edges correspond.

Then the string consisting of the vertex labels ordered by a correct evaluation order corresponds to a string of a trace.

The monoidal operation takes the disjoint union of two graphs' vertex sets, preserves the existing edges in each graph, and draws new edges from the first to the second where the dependency relation allows,[1]: 14 

The identity is the empty graph.

Examples

Dependency graphs are used in:

  • Automated software installers: They walk the graph looking for software packages that are required but not yet installed. The dependency is given by the coupling of the packages.
  • Software build scripts such as Unix Make, Node npm install, php composer, Twitter bower install, or Apache Ant. They need to know what files have changed so only the correct files need to be recompiled.
  • In compiler technology and formal language implementation:
    • Instruction scheduling: Dependency graphs are computed for the operands of assembly or intermediate instructions and used to determine an optimal order for the instructions.
    • Dead code elimination: If no side effected operation depends on a variable, this variable is considered dead and can be removed.
  • Dynamic graph analytics: GraphBolt[2] and KickStarter[3] capture value dependencies for incremental computing when graph structure changes.
  • Spreadsheet calculators. They need to derive a correct calculation order similar to that one in the example used in this article.
  • Web Forms standards such as XForms to know what visual elements to update if data in the model changes.
  • Video games, especially puzzle and adventure video games, which are frequently designed as a graph of dependent relationships between in-game actions.[4]

Dependency graphs are one aspect of:

See also

References

  1. ^ a b Mazurkiewicz, Antoni (1995). "Introduction to Trace Theory" (PDF). In Rozenberg, G.; Diekert, V. (eds.). The Book of Traces. Singapore: World Scientific. ISBN 981-02-2058-8. Retrieved 18 April 2021.
  2. ^ Mugilan Mariappan; Keval Vora (2019). "GraphBolt: Dependency-Driven Synchronous Processing of Streaming Graphs". In European Conference on Computer Systems (EuroSys'19). pp. 25:1–25:16. doi:10.1145/3302424.3303974.
  3. ^ Keval Vora; Rajiv Gupta; Guoqing Xu (2017). "KickStarter: Fast and Accurate Computations on Streaming Graphs via Trimmed Approximations". In International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'17). pp. 237–251. doi:10.1145/3093337.3037748.
  4. ^ Gilbert, Ron. "Puzzle Dependency Charts". Grumpy Gamer. Retrieved 11 January 2020.

Further reading