Matroid partitioning

Matroid partitioning is a problem arising in the mathematical study of matroids and in the design and analysis of algorithms. Its goal is to partition the elements of a matroid into as few independent sets as possible. An example is the problem of computing the arboricity of an undirected graph, the minimum number of forests needed to cover all of its edges. Matroid partitioning may be solved in polynomial time, given an independence oracle for the matroid. It may be generalized to show that a matroid sum is itself a matroid, to provide an algorithm for computing ranks and independent sets in matroid sums, and to compute the largest common independent set in the intersection of two given matroids.[1]

Example

A partition of the edges of the complete bipartite graph K4,4 into three forests, showing that it has arboricity at most three

The arboricity of an undirected graph is the minimum number of forests into which its edges can be partitioned, or equivalently (by adding overlapping edges to each forest as necessary) the minimum number of spanning forests whose union is the whole graph. A formula proved by Crispin Nash-Williams characterizes the arboricity exactly: it is the maximum, over all subgraphs of the given graph , of the quantity .[2]

The forests of a graph form the independent sets of the associated graphic matroid, and the quantity appearing in Nash-Williams' formula is the rank of the graphic matroid of , the maximum size of one of its independent sets. Thus, the problem of determining the arboricity of a graph is exactly the matroid partitioning problem for the graphic matroid. The fact that the elements of this matroid cannot be partitioned into fewer than independent subsets is then just an application of the pigeonhole principle saying that, if items are partitioned into sets of size at most , then at least sets are needed. The harder direction of Nash-Williams' formula, which can be generalized to all matroids, is the proof that a partition of this size always exists.[1]

Formula for partition size

To generalize Nash-Williams' formula, one may replace by a matroid , and the subgraph of with a restriction of to a subset of its elements. The number of edges of the subgraph becomes, in this generalization, the cardinality of the selected subset, and the formula for the maximum size of a forest in becomes the rank . Thus, the minimum number of independent sets in a partition of the given matroid should be given by the formula

.

This formula is indeed valid, and it was given an algorithmic proof by Edmonds (1965).[1][3] In other words, a matroid can be partitioned into at most independent subsets, if-and-only-if for every subset of , the cardinality of is at most .

Algorithms

The first algorithm for matroid partitioning was given by Edmonds (1965).[3] It is an incremental augmenting-path algorithm that considers the elements of the matroid one by one, in an arbitrary order, maintaining at each step of the algorithm an optimal partition for the elements that have been considered so far. At each step, when considering an element that has not yet been placed into a partition, the algorithm constructs a directed graph that has as its nodes the elements that have already been partitioned, the new element , and a special element for each of the independent sets in the current partition. It then forms a directed graph on this node set, with a directed arc for each matroid element that can be added to partition set without causing it to become dependent, and with a directed arc for each pair of matroid elements such that removing from its partition and replacing it with forms another independent set.[1][3]

Now there are two cases:

  • If this graph contains a directed path from an element to the newly considered element , then the shortest such path (or more generally any path that does not have any shortcutting edges) describes a sequence of changes that can be made simultaneously to the partition sets in order to form a new partition, with the same number of sets, that also includes . In this case, the algorithm performs these changes and continues.
  • If, on the other hand, no such path exists, then let consist of the matroid elements from which is reachable in . Each set in the current partition must be a maximal independent set in the restriction , for if some element of could be added to partition set in the restriction, then either there would exist an arc (if partition set is non-maximal in the full matroid ) or an arc where (if the partition set is non-maximal in but maximal in the full matroid). In either case the existence of this arc contradicts the assumed construction of the set , and the contradiction proves that each partition set is maximal. Thus, by the easy direction of the matroid partitioning formula, the number of sets needed to partition is at least
,

so in this case the algorithm may find an optimal partition by placing into its own new independent set and leaving the other independent sets unchanged.[1][3]

The overall algorithm, then, considers each element of the given matroid in turn, constructs the graph , tests which nodes can reach , and uses this information to update the current partition so that it includes . At each step, the partition of the elements considered so far is optimal, so when the algorithm terminates it will have found an optimal partition for the whole matroid. Proving that this algorithm is correct requires showing that a shorcut-free path in the auxiliary graph always describes a sequence of operations that, when performed simultaneously, correctly preserves the independence of the sets in the partition; a proof of this fact was given by Edmonds. Because the algorithm only increases the number of sets in the partition when the matroid partitioning formula shows that a larger number is needed, the correctness of this algorithm also shows the correctness of the formula.[1][3]

Although this algorithm depends only on the existence of an independence oracle for its correctness, faster algorithms can be found in many cases by taking advantage of the more specialized structure of specific types of matroids (such as graphic matroids) from which a particular partitioning problem has been defined.[4]

A matroid sum (where each is a matroid) is itself a matroid, having as its elements the union of the elements of the summands. A set is independent in the sum if it can be partitioned into sets that are independent within each summand. The matroid partitioning algorithm generalizes to the problem of testing whether a set is independent in a matroid sum. Its correctness can be used to prove that a matroid sum is necessarily a matroid.[3][4] An extended problem, that is also sometimes called matroid partition, is to find a largest set that is independent in the matroid sum, that is, a largest set that can be partitioned into sets that are disjoint in each input matroid. Cunningham[5] presents an algorithm for solving this problem on O(n) n-element matroids using calls to an independence oracle.

The matroid intersection problem is finding the largest set that is independent in two matroids and . It may be solved by turning it into an equivalent matroid sum problem: if is a basis of the sum , where is the dual of , then must have full rank in and removing a maximal independent set of from leaves a maximum intersection.[6]

Matroid partitioning is a form of set cover problem, and the corresponding set packing problem (find a maximum number of disjoint spanning sets within a given matroid) is also of interest. It can be solved by algorithms similar to those for matroid partitioning.[4] The fractional set packing and set covering problems associated with a matroid (that is, assign a weight to each independent set in such a way that for every element the total weight of the sets containing it is at most one or at least one, maximizing or minimizing the total weight of all the sets, respectively) can also be solved in polynomial time using matroid partitioning methods.[1]

As well as its use in calculating the arboricity of a graph, matroid partitioning can be used with other matroids to find a subgraph of a given graph whose average degree is maximum, and to find the edge toughness of a graph (a variant of graph toughness involving the deletion of edges in place of vertices).[1]

Matroid-constrained number partitioning is a different problem in which k (the number of subsets in the partition) is fixed. There are k different matroids over the same ground set, and the goal is to partition the ground set into k subsets, such that each subset i is an independent set in matroid i. Subject to this constraint, some objective function should be minimized. In a generalization of this variant, each of the k matroids has a weight, and the objective function depends on the weights (maximum weight, minimum weight or sum of weights).

References

  1. ^ a b c d e f g h Scheinerman, Edward R.; Ullman, Daniel H. (1997), "5. Fractional arboricity and matroid methods", Fractional graph theory, Wiley-Interscience Series in Discrete Mathematics and Optimization, New York: John Wiley & Sons Inc., pp. 99–126, ISBN 0-471-17864-0, MR 1481157.
  2. ^ Nash-Williams, C. St. J. A. (1964), "Decomposition of finite graphs into forests", Journal of the London Mathematical Society, 39 (1): 12, doi:10.1112/jlms/s1-39.1.12, MR 0161333.
  3. ^ a b c d e f Edmonds, Jack (1965), "Minimum partition of a matroid into independent subsets" (PDF), Journal of Research of the National Bureau of Standards, 69B: 67–72, doi:10.6028/jres.069b.004, MR 0190025.
  4. ^ a b c Gabow, Harold N.; Westermann, Herbert H. (1992), "Forests, frames, and games: algorithms for matroid sums and applications", Algorithmica, 7 (5–6): 465–497, doi:10.1007/BF01758774, MR 1154585.
  5. ^ Cunningham, William H. (1986-11-01). "Improved Bounds for Matroid Partition and Intersection Algorithms". SIAM Journal on Computing. 15 (4): 948–957. doi:10.1137/0215066. ISSN 0097-5397.
  6. ^ Edmonds, Jack (1970), "Submodular functions, matroids, and certain polyhedra", Combinatorial Structures and their Applications (Proc. Calgary Internat. Conf., Calgary, Alta., 1969), New York: Gordon and Breach, pp. 69–87, MR 0270945.