Shape graphs and the instantaneous inference of tactical positions in soccer

Wait 5 sec.

IntroductionAssociation football (soccer) is a territorial invasion game in which simple rules lead to complex collective behavior1. Consequently, spatial positioning is of paramount importance in football tactics2. To be able to coordinate quickly and with limited communication, players assume situational responsibilities for opponents and areas of the pitch. Conventional means to describe such collective behavior include tactical positions with labels such as ‘left back’ or ‘center forward,’ and team formations with labels such as ‘4-4-2’ or ‘3-4-3,’ summarizing a split of outfield players into backs, midfielders, and forwards3.However, these descriptors are largely static and decontextualized. Players’ spatial responsibilities generally differ between match phases and even within possession sequences. They are moderated by opponent behavior, and may change temporarily or permanently following the demands of an unfolding match. Although these dynamics are widely acknowledged, media and match reports commonly content themselves with the average locations of players4.We here propose a novel framework to analyze spatial arrangements instantaneously, i.e., at the temporal resolution at which tracking data is available and without using information from other moments in time. Tracking data is utilized at an increasing rate in a variety of team sports5, including football6,7. Our framework integrates and extends previous ideas to describe the dynamic spatial arrangements of teams. Instead of aggregating player locations over periods of time, we analyze each data frame separately to generate time series at the original temporal resolution. From a proximity network imposed onto the set of player locations, we are able to derive tactical positions, team formations, and other aspects of interest for each snapshot and therefore without losing temporal contexts such as match phases.Our two main contributions are as follows.Shape graphs: The shape of a team’s spatial organization is represented as a combination of structure (networks) and geometry (space) by augmenting each frame of spatial tracking data with a novel kind of proximity network.Instantaneous tactical positions: By inferring tactical positions from a single shape graph, each snapshot is interpreted separately. This yields instantaneous positions that facilitate the exploration of dynamic positioning in a novel visualization of such high-frequency time series of spatial information.After introducing these concepts in the next two sections, we demonstrate their use on publicly available tracking data.Team shapeTo characterize a team’s dynamic spatial arrangement in terms of tactically relevant features such as compactness or defensive line height, it is common to derive time series of geometric descriptors such as centroids, lengths, and areas from tracking data8,9. While these are determined on a frame-by-frame basis, more complex features such as player activity or team formations are typically determined from data aggregated over sequences of frames10,11,12,13,14. We combine these two approaches and generate high-frequency time series of complex features.Since players are trained to use other players as spatial references, proximity relations are a natural starting point for the definition of team shape. Delaunay triangulations capture many of these relations in a single structure15 and are therefore used extensively for shape representation in football13,14,16, computer graphics, biometry, and many other domains.A Delaunay triangulation is a planar graph defined on a point set in the plane as exemplified in Fig. 1a. It can be determined efficiently and a characteristic feature is that no circumcircle of a triangle encloses another point. Its graph contains, for instance, the convex hull of the point set, pairwise nearest neighbor relations, and a geometric minimum spanning tree. It also maximizes the minimum angle between incident edges across all triangulations of the point set. Delaunay triangulations are dual to Voronoi diagrams, which were among the first techniques to model what would later be called pitch control17,18,19,20.Fig. 1: Representing the spatial arrangement of a team’s ten outfield players in a 4-4-2 formation.We define a shape graph (b) as a subgraph of the Delaunay triangulation (a) that is more stable against shape-preserving perturbations (c). Note that small movements suffice to cause structural changes (dashed) in the Delaunay triangulation and its dual Voronoi diagram (gray).Full size imageDespite their many desirable properties, Delaunay triangulations also have disadvantages, especially in the context of moving objects. As shown in Fig. 2a, alignments of points can lead to ambiguities and thus sensitivity to perturbation.Fig. 2: Co-circular and near-degenerate point configurations.a For four points on an otherwise empty circle with center c, either diagonal can be in the Delaunay triangulation. b After translating \(r\to {r}^{{\prime} }\), only pq is a Delaunay edge, because the two circles defined by p, q and one of \({r}^{{\prime} },s\) do not enclose the remaining point, whereas those defined by \({r}^{{\prime} },s\) and one of p, q do. The two pairs of circumcenters are necessarily separated by the same angle α.Full size imageAmong the variety of approaches to measure sensitivity and cope with perturbations21,22, we found a scale-invariant measure of angular stability23 particularly suitable for our context. As depicted in Fig. 2b, it is defined as the angle formed by an endpoint of a Delaunay edge and the circumcenters of its two incident triangles. Since circumcenters are the vertices of the dual Voronoi diagram, the angle corresponds to the lengths of dual Voronoi edges in Fig. 1c. Removing all edges with angular stability below some angle α yields a so-called α-stable Delaunay graph. This subgraph, however, is often too sparse, because the joint removal of multiple edges with low stability may reduce connectivity, yield non-convex faces, and thus lead to shape descriptions that do not match the assumptions about spatial orientation of players.We therefore introduce a new kind of Delaunay subgraph which we refer to as shape graph. A straightforward generalization of angular stability to non-triangular faces allows us to remove a least stable edge, update the stability of all edges in the newly created face, and iterate until no unstable edges remain. We use \(\alpha =\frac{\pi }{4}=4{5}^{\circ }\) as the threshold and provide computational details in the section on Methods.Figure 1c illustrates that our shape graphs are less sensitive to shape-preserving movement. Although we here consider only the outfield players of one team in our examples, the approach extends naturally to representations including the goalkeeper, the ball, or the opposition team as additional points.In the next section, we show how to exploit shape graphs to infer tactical positions of players.Instantaneous tactical positionsWe reserve the term position for the tactical concept, and refer to a point with coordinates in space as a location. For the purpose of this contribution, we further distinguish between player positions as spatially, and player roles as behaviorally defined concepts. However, we do not aim for a unified terminology, because stakeholders such as coaching staff, data providers, and the media all have their own nomenclature for tactical positions, often mixing positions with roles. We do attempt to stay consistent with the most common terms, however.A position such as left center back or central attacking midfielder describes where a player is generally expected to play in relation to his or her teammates. A role such as support striker or playmaker describes the actions expected of a player, which may differ even when the player appears in similar locations. Moreover, particular roles such as wing back or false nine can be viewed as combinations of positions assumed in different situations. Note that this differs from the terminology of Bialkowski et al.10 and subsequent work, where the term role is used for a predominantly spatial concept.We propose a simple method to infer positions from shape graphs, which by their very definition ease a number of tasks:Normalization: The structure of shape graphs is, by definition, invariant under translation and scaling, and therefore robust against collective movements such as shifting, contraction, and expansion that teams use to cover space efficiently. This invariance eliminates the need for transformations common in other approaches10,11,24.Identity: When determined separately for each frame of tracking data, the structure and geometry of shape graphs is independent of player identities. Since we infer positions from the geometric graph only, with no regard for who a node represents, no mutual constitution of players and positions through matching10,12 is needed.Orientation: Finally, the space we are concerned with is oriented, from goal line to goal line and from touchline to touchline. Following the logic behind common terminology, and thus to avoid cognitive dissonance, we use the convention of pitches oriented vertically, with the focal team playing upward. This terminology also suggests that the assignment of position labels can be separated into a horizontal (left, central, right) and a vertical (backs, midfielders, forwards) component.For all these advantages, our method is rather simple (see Fig. 3):Splits are determined independent of scale by using the centers of mass for each internal face as references. The two extreme centers define the dashed horizontal levels at which players are separated in Fig. 3. The procedure is applied analogously to the horizontal component, and our handling of edge cases is described in the Methods section.Fig. 3: Decomposition of a shape graph into vertical positions.Centers of faces (small circles) serve as reference for splitting off the upper and lower parts of the convex hull, and the procedure is repeated once on the remaining midfielders.Full size imageAs a result, we obtain up to five ordinal levels horizontally and vertically, or a 5 × 5-matrix of relative positions, which may be labeled, for instance, as shown in Fig. 4.Fig. 4: Player positions are determined from the shape graph of each individual tracking data frame.a Based on structure and geometry, players are assigned color-coded vertical and horizontal levels. b Their combination is translated into a tactical position according to a matrix template. Because of their typical positioning patterns, left and right full backs (LB, RB) and wide forwards (LF, RF) are expected to be found in either of two matrix cells.Full size imageResultsThe purpose of this section is two-fold: first, we aim to build trust in our method to detect meaningful positions, and second, we exemplify how it can be used in match and opposition analysis. For concreteness and replicability, we present results obtained for some publicly available tracking datasets.Validity of position labelsThere is no single concept of tactical position, as many coaches and analysts have their own approaches, players interpret positions differently, and the fluidity of the game leads to varying observable outcomes even if the underlying principles are the same. Hence, differences between any two means of assigning positions are inevitable.The quantitative validation of our shape-graph derived labeling should therefore not be understood as an attempt to measure the accuracy of predictions, but as a demonstration that consistently assigned labels from a major commercial data provider yield consistent and distinguishable patterns of labels in our method.Bassek et al.25 recently published data from seven matches of the 2022/2023 season in the first and second division of German professional men’s football. For each of these matches, the data include optically tracked locations at 25 frames per second (roughly 143,000 frames per match) and provider-assigned position labels as shown in Fig. 5a. To reduce noise and small-sample effects, we double the size of the data as follows. Assuming that there should be no systematic bias in lateral positioning, we add a copy of each frame mirrored across the longitudinal axis (from goal to goal), with players relabeled accordingly (LM to RM, DMR to DML, etc.). This yields between three and five match occurrences for positions LA/LR, LM/RM, DML/DMR, IVZ, and DMZ, and at least 16 for all others (except HST and MZ, which are not present in the data). Moreover, we only consider the 176 players who started a match or were substituted into the game no later than the 75th minute.Fig. 5: Comparison with position labels provider data.Position labels reproduced from Bassek et al.25 (a) and the relative frequency by which our method assigns players with these labels to a cell in the 5 × 5 position matrix (b). No player in the data set is labeled HST (support striker) or MZ (central midfielder), but additional labels DLM, DRM (defensive left/right midfielder) are present.Full size imageFor about one million frames during which the ball is in play we determine our instantaneous horizontal and vertical levels. Level combinations of players with the same label in the data set are aggregated. After normalization, we obtain relative frequencies by which players with the same given label are assigned to each of the 25 matrix cells. Results are depicted in Fig. 5b, where the area of each disc is proportional to the share of frames in which the corresponding levels were assigned.The evident patterns in Fig. 5b suggest that the positioning of players relative to their teammates as determined by our method is strongly associated with the position labels provided in the data. To test the association further, these distributions are used as prototypes for a simple classifier. For each of the seven matches we classify every player who featured by the 75th minute. If \(Q={({q}_{ij})}_{1\le i,j\le 5}\) is the distribution of level combinations for any one appearance of a player, it is assigned to the prototype \(P={({p}_{ij})}_{1\le i,j\le 5}\) from Fig. 5b that is most similar according to the Bhattacharyya coefficient \(BC(P,Q)=\mathop{\sum }\nolimits_{i = 1}^{5}\mathop{\sum }\nolimits_{j = 1}^{5}\sqrt{{p}_{ij}{q}_{ij}}\). Figure 6a confirms that this classification matches the labels provided in the data in more than two-thirds of the cases (122 out of 176). Moreover, a metric multi-dimensional scaling of Hellinger distances \(\sqrt{1-BC(P,Q)}\) between positional distributions suggests that they cluster well and in alignment with given position labels.Fig. 6: Comparison of given position labels with our position distributions for 176 player-match instances.a A classifier assigning the nearest prototype from Fig. 5b largely recovers the given position labels. b The scatter plot is a two-dimensional representation of similarity among position distributions associated with given position labels. Circled outliers are discussed in Fig. 7.Full size imageInspection of mismatches and clustering outliers revealed a few common scenarios suggesting that it is often the labels provided in the data, which are assigned before the start of the match25, that are implausible.Players expected to be fielded on one side actually play on the other side. An example shown in Fig. 7a is Lukas Schleimer of 1. FC Nürnberg in a match against Fortuna Düsseldorf on October 15, 2022. Although commonly a striker and labeled as right center forward (STR), he actually played on the left in a 4-2-2-2 after coming on in the 65th minute.Fig. 7: Examples of mismatches.Position distributions of the examples highlighted in Fig. 6b suggest that their labels provided with the tracking data are not suitable. While a is simply a misclassification, b results from a position change during the match, and c from the overlay of in- and out-of-possession phases.Full size imagePositions may change during a match, which necessarily yields distributions that do not fall squarely into a single category. The example in Fig. 7b is Edmond Tapsoba of Bayer Leverkusen in a match against VfL Bochum on May 27, 2023. Labeled and starting as a left center back (IVL), he moves to the right side when the team changes from four to three at the back at half-time.Positioning may depend on ball possession. The example in Fig. 7c is Dawid Kownacki of Fortuna Düsseldorf in a match against Hansa Rostock on September 10, 2022. Labeled as the center forward (STZ) in a 4-2-3-1 formation, he tended more to the left when his team was out of possession.Match analysis exampleFor an example analysis of position dynamics, we use better known data from a match that has been available for some time. It is the most recent in a small compilation published by Metrica26, and contains anonymized event and tracking data for an unspecified match referred to as Sample Game 3. The data was collected at a rate of 25 frames per second for a total of 143, 761 frames (close to 96 min), each containing location information for all players and the ball.We used Python package kloppy v3.1527 to infer times during which the ball is not in play from the event data. After dropping the corresponding frames from the tracking data, T = 117, 448 frames remained.Shape graphs were computed separately for the outfield players of each team in each frame. Frequent shape graph edges shown in Fig. 8 are restricted to the first half, because substitutions and tactical changes discussed below lead to a mix of principles in the second half.Fig. 8: Frequent shape graph edges.Edges appearing at least 20,000 times (i.e., in more than one third of all frames) during the first half of the Sample Game 3. Thicker and darker edges appear more often, with the maxima for both teams above 50,000 times. Players are placed in their average locations. a A single center forward and few edges between wingers indicate stricter positioning. b Four attackers with close average positions and many edges represent a more varied approach.Full size imageBoth teams play with a back four (four defenders at the back) and a double pivot (two midfielders in front of the back four). Their higher degree in the graph of frequent proximity edges suggests that Players #6 (Team A) and #24 (Team B) cover a substantially larger vertical range than their partnering defensive midfielders #8 and #23, who are in corresponding average locations but seem to play different roles.Average locations of Team B give the false impression of a compact attack through the center, which seems different from the wide locations of #5 and #9 in Team A. Since the individual shape graphs are planar, however, crossing proximity edges already hint at frequent lateral movement. Since it is not possible to infer the actual dynamics from such aggregate representations, we next turn to time series of positions.Using the color scheme from Fig. 4, we visualize the ordinal level pairs for all players as bivariate times series in Fig. 9. For readability and resolution, we aggregate over intervals of six seconds (150 frames) and display the modal level (most frequent color) in each time interval.Fig. 9: Position time series over the entire match.Color pairs represent vertical (upper half of each row) and horizontal (lower half) levels. Gaps indicate substitutions in the team, with dots linking the players involved.Full size imageThese position plots are inspired by visualization designs for collective movement28, and since the colors have an interpretation that persists across teams and matches, they represent more information than seemingly similar depictions of cluster membership in related work10,11,14.In Team A, for instance, Player #10 is recognized as a center forward being almost always highest up the pitch (mostly red in the top row) and rarely moving wide of other players (only light colors in the bottom row). Player #12, on the other hand, replaces right center back (RCB) #2 at half-time and plays in the same position until right back (RB) #1 is replaced by another RCB, #17, and #12 moves to RB instead. Categorizing all other players in the same way, we conclude that Team A play out of a 4-2-3-1 formation. With respect to the aggregate plots of Fig. 8, we observe that the wide players rarely move to the other side.Team B, on the other hand, plays out of a 4-4-2 formation in which the two center forwards and wingers switch sides often. This explains the seemingly compact attacking formation in Fig. 8b, because player rotations visible from the position plots result in more central average locations.An interesting change of tactics can be observed by combining the two charts. When Team A introduce #16 as LDM for AM #7, #14 rotates right, #5 into the center, and #13, having already played more advanced than #8 after coming on at half-time, moves to LF. It appears that Team B respond by replacing a CF with RB #34 and moving #29 into positions between the new RB and RM #30. During this final stretch of the match, Team B appear to abandon the 4-4-2 and free up #29 for man-marking #13.In addition to general trends, coordinated exceptions become apparent. We have already noted that the center forwards of Team B are swapping sides frequently, whereas the wide forwards of Team A are not. A temporary change involving more players can be observed when the center backs of Team A exchange vertical positions with midfielders early in each period, likely to expect a corner kick or a free kick in the opposition half. Similarly, after conceding the second goal, both the full backs LB #4, RB #1) and wide forwards (LF #9, RF #5) of Team A drop deeper until the next substitution is made.Position plots allow for much more detailed analysis when augmented with match phases, main events, ball location, and other information. Such extensions, however, are beyond the scope of this paper.DiscussionShape graphs represent simultaneously the structure and geometry of relative positions and dispense with the need to normalize player locations by translation or scaling. They capture meaningful proximity relationships because players are trained to make use of four different types of spatial references: their teammates, their opponents, the ball, and the pitch29. Although the shape graphs presented here make use of the first type only, inclusion of additional references is straightforward. Shape graphs defined on the locations of all outfield players in both teams, for example, would allow for the derivation of time series that represent marking and pressing dynamics30,31,32.Delaunay triangulations of moving points change when passing through ambiguous configurations of zero angular stability, and our criterion for preemptive removal of edges with low angular stability shifts changes to moments in which a third point is close enough to challenge the existence of an edge representing mutual spatial referencing of two players.As our approach is computationally efficient and instantaneous, it is suitable for use with live tracking data. Future work could include the storage of shape graphs in kinetic data structures33 for efficiency, and in graph databases for querying moments of interest34,35.By deriving them instantaneously from shape graphs, we have separated the detection of positions from their matching with players. Momentary tactical positions of players are inferred from their relative spatial locations. It is therefore not necessary to assume team-level persistence of the set of positions over any period of time. In fact, previous work can be viewed as focusing on the identification of spatial arrangements that are stable over periods of time, and therefore interested just as much in temporal segmentation as in positions and formations. With the template-matching approach of Müller-Budack et al.12 a notable exception, position labels are generally assigned by hand after positions have been determined.Our assignment is systematic and interpretable; and it may be applied to any configuration, including the output of aggregation approaches. After the first vertical split in the example of Fig. 10, the shape graph of the four midfielders forms a horizontal path, so that no further distinction takes place. In the horizontal partition, inside players are split equidistantly (see Methods section), but the center forward is close to being interpreted as another left center forward.Fig. 10: Automated labeling of aggregate configurations.Bauer et al.38 (their Fig. 3) approximate distributions of normalized locations during match phases. With shape graphs defined on their centroids, position labels can be assigned automatically.Full size imageOn a temporal scale, the tactical tweaks pointed out in the Results section are fairly coarse grained. The time a team in transition needs to return to their default shape, and frequent patterns of positional changes are but two examples of interesting analyses at higher temporal resolution. Instantaneous positions may further be used to contextualize individual events. Passes, for instance, are often labeled with time, players involved, their locations, a pass type, and an outcome, but could be augmented with the tactical positions at this moment.The position plots in Fig. 9 are but one example how to visualize position time series. They can be varied, for instance, by filtering, changing the level of resolution and annotation with further match events. The latter may include bookings, match phases, momentum, or any other information common for match time-lines and potentially driving positional changes or providing situational context. Our example should therefore not be considered a specific chart design, but a framework to construct static depictions of positional dynamics.MethodsTo complement the introduction of shape graphs and instantaneous positions, we outline in this section some additional details relevant for their implementation.Shape graphsAssume we are given a sequence of tracking data frames \({P}^{(t)}=\{{p}_{1}^{(t)},\ldots ,{p}_{n}^{(t)}\}\) containing two-dimensional spatial locations \({p}_{i}^{(t)}\in {{\mathbb{R}}}^{2}\) for n objects at discrete time steps t = 1, …, T. In the present scenario with 25 Hz tracking and a focus on the outfield players, we have T ≈ 140, 000, and n = 10 for each team.Given a finite point set \(P\subset {{\mathbb{R}}}^{2}\) in general position, a Delaunay triangulation is a planar graph D(P) = (P, E) such that every internal face is a triangle whose circumcircle does not enclose another point of P. The circumcenters of those triangles are the vertices of the dual Voronoi diagram.For p, q ∈ P, consider the oriented line \(\vec{pq}\) through p and q, and let Ppq = {r ∈ P⧹{p, q}: (q − p) × (r − p) 135°. b For an internal edge adjacent to a face that is no longer a triangle, the removal condition has to be fulfilled with all of its corners, i.e., for \(\mathop{\min }\nolimits_{i=1,\ldots ,k}\angle p{r}_{i}q\) replacing ∠prq. c For an edge on the outer face, the outside angle is set to 0°; the removal condition becomes ∠psq > 135° or, equivalently, s being inside the gray area.Full size imageAlgorithm 1Shape graphThe shape graph of a set of n points is determined by Algorithm 1, where angles associated with edges in a newly merged face are updated after each removal. Worst-case running time is therefore \({\mathcal{O}}({n}^{2})\), as evidenced by any convex point set P, but in applications to football we typically have n ≤ 23 (all players and the ball) and only few removals.While essentially absent from match data, ties in least angular stability do occur in textbook examples of spatial arrangements. If two edges are tied for minimum stability and the removal of one leads to a stability update that lets us keep the other edge, we in fact keep both to maintain symmetry.It is important to note that shape graphs are geometric graphs36, i.e., combinations of network structures and spatial information. In addition to the combinatorial structure of player proximity, they also represent geometric properties of that structure, and therefore quantitative spatial information that can be organized in variables associated with the structural elements. Examples include the area of faces or the orientation of edges relative to the direction of play. This representation is lossless, and since the additional memory needed for their storage is linear in the number of points, the overhead is negligible.Tactical positionsSplits obtained with face centers as reference are tolerant to shape changes in other parts of the team. The latter present difficulties for approaches based on pairwise distances and clustering24,37.When multiple players align on circles or straight lines, however, shape graphs may contain too few faces and potentially even bridging edges in the outer face. To ensure robust and meaningful assignment of positions, the following exceptions are introduced in our two-tier decomposition from above:If the shape graph contains steep bridging edges, the edges themselves are treated as faces, so that their center is in the middle of the edge. The slope of the bridging edge in Fig. 12 aligns with the vertical partition in 12a, but defines threshold for the first horizontal split in 12b.Fig. 12: Special cases in the assignment of position levels.a Centered split. b Equidistant secondary split.Full size imageIf a group of at least four vertices is maximally imbalanced, i.e., divided into a singleton, an empty center, and the rest, then the large group is assigned to the central partition. Otherwise, a configuration such as the one in Fig. 12a, which resembles the midfield of a 4-1-4-1 formation, would have four attacking midfielders.If all centers of internal faces and at least one vertex are in the middle third between the highest and lowest vertex, we split into equal thirds instead. See the secondary split in Fig. 12b for example.Data availabilityThe data used in this article was made publicly available in Metrica (2021) and Bassek et al. (2025).ReferencesTeoldo, I., Guilherme, J. & Garganta, J. Football Intelligence: Training and Tactics for Soccer Success (Taylor & Francis, 2022).Wilson, J. Inverting the Pyramid: The History of Soccer Tactics (Nation Books, 2013).Sotudeh, H. The principles of tactical formation identification in association football (soccer) – a survey. Front. Sports Active Living 6, 1512386 (2025).Whitmore, J. & Seidl, T. Shape analysis: automatically detecting formations. https://web.archive.org/web/20210301192414/https://www.statsperform.com/resource/shape-analysis-automatically-detecting-formations/ (2021).Torres-Ronda, L., Beanland, E., Whitehead, S., Sweeting, A. & Clubb, J. Tracking systems in team sports: a narrative review of applications of the data and sport specific analysis. Sports Med. Open 8, 15 (2022).Google Scholar Memmert, D. & Raabe, D. Data Analytics in Football: Positional Data Collection, Modelling and Analysis (Taylor & Francis, 2024).Andrienko, G. et al. Constructing spaces and times for tactical analysis in football. IEEE Trans. Visual. Comput. Graph. 27, 2280–2297 (2021).Google Scholar Bartlett, R., Button, C., Robins, M., Dutt-Mazumder, A. & Kennedy, G. Analysing team coordination patterns from player movement trajectories in soccer: methodological considerations. Int. J. Perform. Anal. Sport 12, 398–424 (2012).Google Scholar Low, B. et al. A systematic review of collective tactical behaviours in football using positional data. Sports Med. 50, 343–385 (2020).Google Scholar Bialkowski, A. et al. Large-scale analysis of soccer matches using spatiotemporal tracking data. In IEEE International Conference on Data Mining, 725–730 (Institute of Electrical and Electronics Engineers, 2014).Shaw, L. & Glickman, M. Dynamic analysis of team strategy in professional football. In Barça Sports Analytics Summit (Barça Innovation Hub, 2019). https://barcainnovationhub.com/event/barca-sports-analytics-summit-2019/.Müller-Budack, E., Theiner, J., Rein, R. & Ewerth, R. “Does 4-4-2 exist?”: an analytics approach to understand and classify football team formations in single match situations. In Proc. 2nd International Workshop on Multimedia Content Analysis in Sports, MMSports ’19, 25–33 (Association for Computing Machinery, 2019).Narizuka, T. & Yamazaki, Y. Clustering algorithm for formations in football games. Sci. Rep. 9, 13172 (2019).Google Scholar Kim, H., Kim, B., Chung, D., Yoon, J. & Ko, S.-K. SoccerCPD: formation and role change-point detection in soccer matches using spatiotemporal tracking data. In Proc. 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD ’22, 3146–3156 (Association for Computing Machinery, 2022).Aurenhammer, F., Klein, R. & Lee, D.-T.Voronoi Diagrams and Delaunay Triangulations (World Scientific, 2013).Raabe, D., Biermann, H., Bassek, M., Memmert, D. & Rein, R. The dual problem of space: Relative player positioning determines attacking success in elite men’s football. J. Sports Sci. 42, 1821–1830 (2024).Google Scholar Kawashima, Yoshino & Aoki. Qualitative image analysis of group behaviour. In 1994 Proc. IEEE Conference on Computer Vision and Pattern Recognition, 690–693 (Institute of Electrical and Electronics Engineers, 1994).Taki, T., Hasegawa, J. & Fukumura, T. Development of motion analysis system for quantitative evaluation of teamwork in soccer games. In Proc. 3rd IEEE International Conference on Image Processing, Vol. 3, 815–818 (Institute of Electrical and Electronics Engineers, 1996).Spearman, W. Quantifying pitch control. In OptaPro Analytics Forum. https://doi.org/10.13140/RG.2.2.22551.93603 (2016).Fernández, J. & Bornn, L. Wide open spaces: a statistical technique for measuring space creation in professional soccer. In MIT Sloan Sports Analytics Conference (MIT Sloan School of Management, 2018). https://www.sloansportsconference.com/research-papers/wide-open-spaces-a-statistical-technique-for-measuring-space-creation-in-professional-soccer.Abellanas, M., Hurtado, F. & Ramos, P. A. Structural tolerance and Delaunay triangulation. Inform. Process. Lett. 71, 221–227 (1999).MathSciNet Google Scholar Liang, X., Bishnu, A. & Asano, T. A robust fingerprint indexing scheme using minutia neighborhood structure and low-order delaunay triangles. IEEE Trans. Inform. Forensics Secur. 2, 721–733 (2007).Google Scholar Agarwal, P. K. et al. Stable delaunay graphs. Discrete Comput. Geometry 54, 905–929 (2015).MathSciNet Google Scholar FIFA High Performance, T. S. G. Enhanced Football Intelligence. https://www.fifatrainingcentre.com/media/native/world-cup-2022/Enhanced%20Football%20Intelligence%20EN.pdf (2022).Bassek, M., Rein, R., Weber, H. & Memmert, D. An integrated dataset of spatiotemporal and event data in elite soccer. Sci. Data 12, 195 (2025).Google Scholar Metrica. Metrica Sports Sample Data (2021). https://github.com/metrica-sports/sample-data/tree/master/data/Sample_Game_3.PySport. kloppy: standardizing soccer tracking and event data. https://kloppy.pysport.org/ (2024).Buchmüller, J. F., Schlegel, U., Cakmak, E., Keim, D. A. & Dimara, E. SpatialRugs: a compact visualization of space and time for analyzing collective movement data. Comput. Graph. 101, 23–34 (2021).Google Scholar Escher, T. Der Schlüssel zum Spiel: wie moderner Fußball funktioniert (Rowohlt Taschenbuch Verlag, 2020).Buldú, J. M. et al. Football tracking networks: Beyond event-based connectivity. In Analytics in Sports Tomorrow (Barça Innovation Hub, 2020). https://arxiv.org/abs/2011.06014.Chacoma, A., Billoni, O. V. & Kuperman, M. N. Complexity emerges in measures of the marking dynamics in football games. Phys. Rev. E 106, 044308 (2022).MathSciNet Google Scholar Andrienko, G. et al. Visual analysis of pressure in football. Data Mining and Knowledge Discovery 31, 1793–1839 (2017).MathSciNet Google Scholar Guibas, L. Kinetic Data Structures. Handbook of Data Structures and Applications, 377–388 (Chapman and Hall/CRC, 2017).Sha, L. et al. Chalkboarding: a new spatiotemporal query paradigm for sports play retrieval. In Proc. 21st International Conference on Intelligent User Interfaces, IUI ’16, 336–347 (Association for Computing Machinery, 2016).Stein, M. et al. From movement to events: improving soccer match annotations. In Kompatsiaris, I. et al. (eds.) MultiMedia Modeling, Lecture Notes in Computer Science, 130–142 (Springer International Publishing, 2019).Pach, J. Geometric Graph Theory. In Preece, D. A. & Lamb, J. D. (eds.) Surveys in Combinatorics, 1999, vol. 267 of London Mathematical Society Lecture Note Series, 167–200 (Cambridge University Press, 1999).Fernández de la Rosa, J. A framework for the analytical and visual interpretation of complex spatiotemporal dynamics in soccer. Ph.D. Thesis, Universitat Politécnica de Catalunya.http://www.tdx.cat/handle/10803/673529 (2022).Bauer, P., Anzer, G. & Shaw, L. Putting team formations in association football into context. J. Sports Anal. 9, 39–59 (2023).Google Scholar Download referencesAcknowledgementsOur method was developed on data provided by UEFA and FIFA for research purposes. We have benefited from feedback and discussions with professional match analysts, most notably Kevin Ehmes, Timo Gross, and Yannick Herkommer. Two reviewers provided constructive feedback that helped improve both content and exposition.FundingOpen access funding provided by Swiss Federal Institute of Technology Zurich.Author informationAuthors and AffiliationsSocial Networks Lab, ETH Zurich, Zurich, SwitzerlandUlrik Brandes, Hadi Sotudeh, Doğan Parlak, Paolo Laffranchi & Mert ErkulAuthorsUlrik BrandesView author publicationsSearch author on:PubMed Google ScholarHadi SotudehView author publicationsSearch author on:PubMed Google ScholarDoğan ParlakView author publicationsSearch author on:PubMed Google ScholarPaolo LaffranchiView author publicationsSearch author on:PubMed Google ScholarMert ErkulView author publicationsSearch author on:PubMed Google ScholarContributionsU.B. conceptualized the research. All authors contributed to the specifics of the methods and implemented various test. H.S. implemented a consolidated version and produced the results. U.B. wrote the manuscript text and prepared the figures; U.B. and H.S. revised the manuscript.Corresponding authorCorrespondence to Ulrik Brandes.Ethics declarationsCompeting interestsThe authors declare no competing interests.Additional informationPublisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.Rights and permissionsOpen Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.Reprints and permissionsAbout this article