STAMP: A shot-type-aware areal multilevel Poisson model for league-wide comparison of basketball shot charts

ST AMP: A shot-t yp e-a w are areal m ultilev el P oisson mo del for league-wide comparison of bask etball shot c harts Kazuhiro Y amada 1 and Keisuk e F ujii 1,2* 1 Graduate Sc ho ol of Informatics, Nago ya Univ ersit y , Nagoy a, Japan. 2 Cen ter for Adv anced Intelligence Pro ject, RIKEN, Osak a, Japan. *Corresp onding author(s). E-mail(s): fujii@i.nago y a-u.ac.jp ; Con tributing authors: yamada.k azuhiro.t5@s.mail.nagoy a-u.ac.jp ; Abstract Sho oting lo cation is a core indicator of oﬀensive style in in v asion sp orts. Exist- ing basketball shot-chart analyses often use spatial information for descriptive visualization, lo cation-based eﬃciency modeling, or clustering play ers in to sho ot- ing archet yp es, yet few studies provide a uniﬁed framework for fair comparison of shot-type-sp eciﬁc tendencies. W e prop ose the shot-type-aw are areal multilev el P oisson (ST AMP) mo del, whic h join tly models team-lev el ﬁeld-goal attempts across predeﬁned court regions, seasons, and shot types using a P oisson likeli- ho od with a p ossession-based exp osure oﬀset. The hierarc hical random-eﬀects structure combines team, area, team-area, and team-side random eﬀects with shot-t yp e-sp eciﬁc random slop es for key shot categories. W e ﬁt the mo del using appro ximate Bay esian inference via the Integrated Nested Laplace Approxima- tion (INLA), enabling eﬃcient analysis of more than 3 × 10 5 shots from tw o seasons of B.LEA GUE (the men’s professional bask etball league in Japan). The ST AMP model ac hieves better out-of-sample predictiv e p erformance than simpler baselines, yielding interpretable relative-rate maps and left-right bias summaries. Case studies illustrate how the mo del reveals team-sp eciﬁc spatial tendencies for comparativ e analysis, and we discuss its limitations and p oten tial extensions. 1 1 In tro duction In inv asion games, sho oting is one of the most crucial actions. Beyond its practical v alue for p erformance ev aluation and scouting, shot data ha ve also b ecome a testb ed for methodological innov ation. This type of work is esp ecially prev alen t in soccer and bask etball; how ever, because their purp oses and tasks diﬀer, the mo deling strategies adopted in each ﬁeld often diverge. In so ccer, the relativ e rarit y of goals, which are the most critical even t, has caused rep eated attention to the v alue of shots. The most w ell-known metric for shots in so ccer is the Exp ected Goal (xG), whic h represen ts the probability that a given shot results in a goal ( Anzer & Bauer , 2021 ; Lucey , Bialko wski, Monfort, Carr, & Matthews , 2015 ; Rathk e , 2017 ; Xu, Bretzner, W ang, & Maki , 2025 ). Using trac king data (time- series data that con tin uously record play er or ball positions/velociti es during a game) and even t data (discrete data that record action types and co ordinates during a game, suc h as passes), situation-dep enden t goal probabilities are mo deled and quan tiﬁed. The models used include logistic regression ( Brechot & Flepp , 2020 ; Eggels, V an Elk, & Pec henizkiy , 2016 ; F airchild, Pelec hrinis, & Kokk o dis , 2018 ; Pardo , 2020 ), gradi- en t bo osting trees suc h as X GBo ost ( Chen & Guestrin , 2016 )( Cavus & Biecek , 2022 ; Hewitt & Karakuş , 2023 ; Mead, O’Hare, & McMenemy , 2023 ; Pardo , 2020 ), and in some cases (Bay esian) hierarchical mo dels ( Scholtes & Karakuş , 2024 ; T ureen & Olthof , 2022 ). By con trast, in basketball, while research attempting to capture shot quality do es exist ( Kambhamettu, Shriv astav a, & Gwilliam , 2024 ; Schmid, Schöpf, & Kolbinger , 2025 ), it is relativ ely scarce, partly because pla yer p osture data, although critical, are not widely a v ailable. Instead, w ork that visualizes or quan tiﬁes sho oting eﬃciency or tendencies is more common ( Y amada & F ujii , 2025 ). Previous studies ( Cao, Y ang, & Hu , 2025 ; Ho , 2025 ; Miller, Bornn, A dams, & Goldsb erry , 2014 ; Reich, Ho dges, Carlin, & Reich , 2006 ; Scrucca & Karlis , 2025 ; W ong-T oi, Y ang, Shen, & Hu , 2023 ; Zuccolotto, Sandri, & Manisera , 2021 , 2023 ) ha ve centered on mo deling spatial information using shot charts, whic h record the lo cations of shots taken. This literature can broadly b e divided into t wo strands: studies fo cusing on sho oting eﬃciency b y lo cation and studies fo cusing on spatial sho oting tendencies. The former mainly addresses the question of where shots should b e taken, with an eye tow ard applications in optimizing shot strategy ( Ehrlich & Sanders , 2024 ; Jiao, Hu, & Y an , 2021 ; Sandholtz, Mortensen, & Bornn , 2020 ). The latter is primarily concerned with understanding play ers’ or teams’ spatial preferences, sometimes as a basis for clustering them into archet yp es. F or this second strand, the modeling approac h most frequen tly applied to shot c harts has b een the Log-Gaussian Cox Pro cess (LGCP; Møller, Syversv een, and W aagep etersen ( 1998 )) ( Cao, Cai, et al. , 2025 ; Hu, Y ang, & Xue , 2021 ; Miller et al. , 2014 ; Sandholtz et al. , 2020 ; Yin, Hu, & Shen , 2023 ). LGCP mo dels the log-intensit y ov er contin uous space as a Gaussian pro cess, t yp- ically estimated v ia a discretized, sparsit y-inducing represen tation. Ho wev er, when marks, that is, categorical lab els asso ciated with each ev ent (e.g., shot type: jump shot, la yup, etc.), are introduced, the contin uous ﬁeld can absorb v ariation that should instead b e attributed to those categories. F or comparativ e inference across pla yers or teams, one natural approach is to introduce random eﬀects indexed b y grouping 2 factors (suc h as team or play er) and to imp ose cen tering or sum-to-zero constraints so that these eﬀects are iden tiﬁable and interpretable as deviations from a common baseline. These additions, how ever, increase coupling among parameters, reduce spar- sit y , and w orsen the conditioning of the Hessian, leading to numerical instabilit y and higher computational cost. In practice, LGCPs can therefore b e cumbersome for fair comparisons b et ween pla yers or teams, esp ecially when mark information is required. A ccordingly , w e in tro duce the shot-type-aw are areal m ultilevel Poisson (ST AMP) mo del, a P oisson generalized linear mixed model (GLMM) for shot counts ov er prede- ﬁned court regions. By incorp orating p ossession counts as an oﬀset term and random eﬀects for team, area, team × area, team × side and shot-t yp e-dep enden t team × area slop es, ST AMP yields exp osure-adjusted comparisons across teams. Mathematically , the sp eciﬁcation is a laten t Gaussian mo del, which enables fast Bay esian inference via the Integrated Nested Laplace Approximation (INLA; Rue, Martino, and Chopin ( 2009 )). This structure allows us to ﬁt league-scale, shot-type-aw are spatial mo dels in min utes; thus, the metho d is practical for routine performance analysis. Similarly , shot counts ov er predeﬁned areas hav e b een mo deled using a Poisson GLMM ( W ong-T oi et al. , 2023 ). Their goal, how ever, is play er categorization while accoun ting for shot success, whereas our fo cus is comparative inference for play ers and teams. In particular, w e emphasize team-level shot c harts as a primary unit of analysis, whic h has b een less frequen tly addressed in prior spatial shot-chart work. F urthermore, b y using p ossession counts as an oﬀset, w e mitigate concerns regarding comparability based on exp osure, a challenge in that study , enabling rate-based comparisons ev en when exposure levels diﬀer. In parallel, marked spatial p oin t process approaches hav e b een proposed ( Jiao et al. , 2021 ; Y eung, Sit, & F ujii , 2025 ). In contrast, our model is more interpretable, as it adopts a m ultiplicatively separable, log-linear decomp osition on the rate scale, and it is more scalable b ecause inference relies on sparse GLM- M/INLA machinery rather than Marko v Chain Monte Carlo (MCMC) or deep neural net work-based models. Our contributions are threefold: • Mo del: reusable template for marked spatial count data. W e form ulate the ST AMP mo del as a latent Gaussian Poisson GLMM for spatially structured coun t data with multiplicativ ely separable, log-linear rate comp onen ts indexed by team, season, area, side, and shot type. Shot type plays the role of a categorical mark, with mark-speciﬁc random slop es on team-b y-area eﬀects; all random eﬀects (team, area, team × area, team × side, and mark-sp eciﬁc slop es) are mean-centered and constrained to sum to zero, ensuring iden tiﬁability and in terpretable log-rate con trasts. While our empirical study focuses on bask etball, the same template can b e applied to other inv asion sp orts or spatially aggregated ev ent pro cesses deﬁned o ver pre-speciﬁed regions and categorical marks. • Inference & scalability: fast and stable. The ST AMP mo del remains within the laten t Gaussian class and is ﬁtted via INLA with Penalized Complexity (PC) priors on v ariance and correlation h yp erparameters ( Simpson, Rue, Riebler, Martins, & Sørb ye , 2017 ), a voiding MCMC and heavy deep arc hitectures while still providing stable approximate Bay esian inference for league-scale data (on the order of 3 × 10 5 shots) in roughly a min ute p er mo del on a standard CPU. 3 • Utilit y: league-wide in terpretabilit y and fair comparisons. The mo del pro vides interpretable log-rate contrasts, supp orts exp osure-fair team-to-team com- parisons by incorp orating p ossession counts as an oﬀset. In the exp erimen ts, we demonstrate the predictiv e v alidity and practical application in the B.LEAGUE (the men’s professional basketball league in Japan). In brief, the ST AMP mo del is a Poisson GLMM for areal shot coun ts with a p os- session oﬀset and ﬁxed eﬀects for shot-type, season, and court side. It includes four k ey random-eﬀects comp onen ts, tw o interaction terms (team × area and team × side) and tw o shot-t yp e-speciﬁc team × area slop es, that capture team- and location-sp eciﬁc deviations in shot tendencies within a latent Gaussian INLA framework with PC pri- ors on v ariance and correlation hyperparameters. Using only priors that pass prior predictiv e chec ks, we then systematically ev aluate out-of-sample predictiv e perfor- mance under alternative random-eﬀects sp eciﬁcations obtained b y switching on and oﬀ these four comp onents. F ull details of the mo del sp eciﬁcation, prior choices, and exp erimen tal design are given in Metho ds 4 . Guided b y this speciﬁcation, our empiri- cal analysis is designed to address three questions. First, we quan tify how m uch eac h of the four random-eﬀects comp onen ts contributes to out-of-sample predictive p er- formance by comparing the sixteen candidate structures obtained by switc hing them on and oﬀ (Section 2.2 ). Second, we inv estigate what spatial and left-right patterns the fully sp eciﬁed ST AMP mo del reveals at the league and team levels, fo cusing on represen tative B.LEA GUE teams (Section 2.3 ). Third, we examine whether adding further team-level slop es for shot-type preferences yields meaningful gains in predictiv e p erformance beyond the core ST AMP structure (Section 2.4 ). 2 Results 2.1 Dataset In this study , we use play-b y-play data for a total of 2,352 games from the 2023–24 and 2024–25 B.LEA GUE seasons. Play-b y-play data are time-stamp ed logs that record on- court actions such as shots and reb ounds, as well as game even ts such as substitutions. Note that B.LEAGUE play-b y-play data is av ailable for viewing on the oﬃcial website for each game, but it is not provided as op en data in a machine-readable format for bulk acquisition for researc h purp oses. This study utilized play-b y-play data with shot lo cation provided for research purp oses by Data Stadium Inc., the oﬃcial data supplier for B.LEAGUE. In the B.LEAGUE, each team plays against an opp onen t in rounds consisting of one or tw o consecutive games; during the regular-season, B1 teams play 60 games ov er 36 rounds, and B2 teams play 60 games ov er 32 rounds. Afterwards, the top eight teams in each division adv ance to a p ost-season tournament pla y ed in a best-of-three (t wo-win) format. F or modeling, w e use only the num b er of ﬁeld-goal attempts in eac h court area for eac h shot-type, after standardizing the oﬀensiv e direction, together with the num b er of p ossessions for each team. The shot-t yp e is determined from the action type ﬁeld in the pla y-by-pla y data, and the court area divisions follo w the deﬁnitions commonly used 4 Fig. 1 Half-court area divisions; 1: under basket, 2: in the paint, 3: inside right wing, 4: inside right, 5: inside center, 6: inside left, 7: inside left wing, 8: outside right wing, 9: outside right, 10: outside center, 11: outside left, 12: out side leftwing, 13: back court. in the B.LEAGUE (Figure 1 ). How ever, left and righ t symmetric areas are merged in to a single area (for example, outside left and outside right are merged into outside ). W e also merge several similar shot-type lab els to ensure suﬃcient counts in each category: fadea wa y and turnaround jump shot; la yup and driving la yup; dunk, tip-in, alley- o op, and ho ok shot; and step-bac k jump shot and pull-up jump shot, for a total of 6 categories. The dataset con tains 306,444 ﬁeld-goal attempts in total, of which jump shots (denoted jump_shot ) account for 156,423 attempts, step-back / pull-up jump shots (denoted step_pull ) accoun t for 50,454 attempts, layup group (denoted lay_up ) accoun t for 48,550 attempts, ﬂoating jump shots (denoted floater ) account for 19,000 attempts, dunk / tip-in / alley-o op / ho ok shots (denoted rim_finishes ) acoun t for 17,819 attempts, and fadeaw ay / turnaround jump shots (denoted fade_turn ) accoun t for 14,198 attempts. The mean num b er of ﬁeld-goal attempts p er team p er season is 4,032.16, with a standard deviation of 251.80. Note that each court area is also assigned to a coarse side category (Left, Cen ter, Righ t) based on Figure 1 : zones 6, 7, 11, and 12 are treated as Left; zones 1, 2, 5, 10, and 13 as Cen ter; and zones 3, 4, 8, and 9 as Right. 2.2 Predictiv e p erformance of ST AMP mo del W e ﬁrst compare predictive p erformance across the sixteen structures obtained by switc hing on or oﬀ each group of random-eﬀect components: team × area eﬀects ( use_ta ), team × side eﬀects ( use_ts ), shot-type-sp eciﬁc random slop es for jump shots ( use_jump ), and shot-type-sp eciﬁc random slop es for step-back/pull-up shots ( use_step ). All candidates share the same ﬁxed-eﬀects structure; only the presence or absence of these random-eﬀect blo c ks diﬀers (see Metho ds 4 for details). F or each random-eﬀects structure, w e ﬁt the ST AMP mo del to regular-season data and ev aluate out-of-sample predictiv e performance on the p ost-season data. Our 5 T able 1 Mo del comparison for core structures. The columns use_ta , use_ts , use_jump , and use_step indicate whether each comp onen t is included ( ✓ ) or excluded (–). The primary criterion is the Monte Carlo estimate of the out-of-sample exp ected log predictive density d elpd MC post , with LPML regular reported as a secondary , ﬁtting-data criterion. Larger v alues (less negative) indicate b etter predictive performance. All v alues are computed under the prior setting U sd = 1 . 5 and U slope = 1 . 0 describ ed in Section 4.2 . Boldface en tries denote the b est v alue among all 16 candidate structures, including those not sho wn in this table. use_ta use_ts use_jump use_step d elp d MC post ↑ LPML reg ↑ – – – – − 6 , 581 − 130 , 617 ✓ – – – − 6 , 422 − 115 , 261 – – ✓ ✓ − 3 , 499 − 50 , 141 ✓ – ✓ ✓ − 3 , 385 − 37 , 835 ✓ ✓ ✓ ✓ − 3 , 383 − 49 , 967 primary criterion is the Monte Carlo approximation d elp d MC post of the exp ected log pre- dictiv e density on p ost-season p ossessions, computed by drawing p osterior samples of the latent Gaussian ﬁeld and av eraging the log-likelihoo d con tributions cell-wise. As a complemen tary ﬁtting-data measure, we also report the log pseu do-marginal likeli- ho od on the regular-season data, LPML reg . T able 1 summarizes d elp d MC post and LPML reg for selected core structures under a representativ e prior conﬁguration. The simplest baseline that omits all random eﬀects ( use_ta = 0 , use_ts = 0 , use_jump = 0 , use_step = 0 ) exhibits the worst predictive p erformance for post-season data. Adding only team × area or team × side in teractions yields mo dest impro vemen ts, while the largest gains arise when both interactions and shot-type-sp eciﬁc slop es are included. The full mo del with all four comp onen ts switched on ( use_ta = 1 , use_ts = 1 , use_jump = 1 , use_step = 1 ) ac hieves the b est predictive score for p ost-season, outp er- forming the baseline by roughly ∆ d elp d MC post ≈ 3 . 2 × 10 3 while also attaining the highest LPML reg among the structures with use_ta = 1 , use_jump = 1 , and use_step = 1 . How- ev er, the gain from including the team × side eﬀects is very small in out-of-sample terms: the mo del with use_ta = 1 , use_ts = 0 , use_jump = 1 , use_step = 1 attains an d elp d MC post only tw o units w orse than the full mo del, while ac hieving a substantially b et- ter LPML reg . This suggests that most of the predictiv e gain comes from the team × area in teractions and jump/step slopes, and that team × side eﬀects pro vide, at best, a marginal reﬁnement. Despite its ric her random-eﬀect structure, the full mo del remains computationally feasible: a single INLA ﬁt requires on the order of one min ute of w all-clo c k time on our hardware (see Section 4.3 ). T o assess the sensitivit y of this comparison to the prior h yp erparameters, we rep eated the sixteen-model comparison across all prior conﬁgurations that passed the prior predictiv e chec ks in Section 4.2 . F or each random-eﬀects structure, we a veraged d elp d MC post and LPML reg o ver these prior settings; the full mo del remained the b est p er- former in terms of mean d elp d MC post , and the improv ement ov er the baseline stay ed on 6 Fig. 2 Left-righ t asymmetry caterpillar plot. The horizontal axis sho ws the logarithm of the ratio of relative occurrence rates b etw een right and left (aggregated for each team using the geometric mean ov er the season; v alues closer to the left indicate a bias tow ard the left side). The dashed line indicates no left-right bias, the points represent the estimated v alues, and the horizontal bars represen t the approximate 95% conﬁdence interv al. the order of 3 . 2 × 10 3 . Moreov er, for every prior conﬁguration, the highest d elp d MC post w as alwa ys attained by one of the t wo richest structures, use_ta = 1 , use_jump = 1 , use_step = 1 with use_ts either 0 or 1 ; no sparser sp eciﬁcation ever ranked ﬁrst. The diﬀerence betw een these tw o top mo dels was alwa ys small (on the order of a few units in d elp d MC post ), reinforcing the view that team × side eﬀects play a secondary role compared with team × area interactions and shot-type-sp eciﬁc slop es. The ranking of the sixteen structures was also highly stable across prior conﬁgurations: Kendall’s τ rank correlations b et ween the model rankings were ab o ve 0.97. T aken together, these results indicate that the qualitative adv antage of including team × area interactions and shot-t yp e-speciﬁc random slopes (with or without team × side eﬀects) is robust to reasonable changes in the prior sp eciﬁcation. 2.3 Spatial patterns and interpretation T o illustrate how the ST AMP model with all random eﬀects can b e used to interpret spatial tendencies in practice, we ﬁrst summarize league-wide left-righ t biases and then present detailed examples for tw o representativ e B.LEAGUE teams, Utsunomiya Brex and Nagoy a Diamond Dolphins. Figure 2 shows a caterpillar plot of the team-by-side eﬀects z ids , summarized as the log ratio log(Right / Left) of the p osterior multipliers for the right and left sides, aggregated ov er seasons via a geometric mean. V alues greater than zero indicate a preference for shots on the righ t side, whereas negativ e v alues indicate a left-side 7 Fig. 3 Area-sp eciﬁc relativ e app earance trends for the Utsunomiya Brex in the 2024-25 season (Left: jump_shot , Right: step_pull ); side scaling applied. Colors and lab els represent team-level p ercen tiles (pXX), with larger v alues and brighter colors indicating relatively more frequent o ccurrences. Fig. 4 Area-sp eciﬁc relative app earance trends for the Nagoy a Diamond Dolphins in the tw o seasons (left: 2023–24 season, right: 2024–25 season); side scaling applied. Colors and lab els represent team- level percentiles (pXX), with larger v alues and brighter colors indicating relatively more frequent occurrences. preference; horizontal bars indicate approximate 95% credible interv als. Most teams exhibit mo derate left-right asymmetries whose interv als o verlap the symmetric case log(Right / Left) = 0 , but a few teams sho w clearly identiﬁable side-to-side biases. This type of summary is directly in terpretable for coaches and analysts: it highlights whether a team tends to fav or one side of the court ov er the other, after adjusting for o verall shot volume. W e then fo cus on tw o B.LEA GUE teams, Utsunomiy a Brex and Nagoy a Diamond Dolphins, to provide concrete examples of the spatial patterns captured by the ST AMP mo del. F or eac h team, we combine the estimated shot-type-sp eciﬁc random slopes with the team × side eﬀects and conv ert the resulting p osterior mean rates into p er- cen tiles relative to the league-wide distribution for each area. Intuitiv ely , these maps summarize where a team sho ots more or less frequently than the league baseline, after accoun ting for o verall shooting v olume. 8 Figure 3 sho ws side-adjusted shot charts for Utsunomiy a Brex in the 2024-25 sea- son, for jump_shot (left panel) and step_pull (right panel). Each tile corresp onds to one of the predeﬁned court regions, and its color enco des the team-lev el p ercen tile of the p osterior rate among all teams in the same season (brighter colors indicate higher p ercen tiles), while o verlaid lab els (pXX) rep ort the corresp onding p ercen tile in n umerical form. By construction, the ﬁgure highlights hot and cold regions in terms of relativ e attempt frequency , rather than raw shot counts, making it easy to iden tify where Utsunomiya is particularly active or inactive compared with league-wide ten- dencies. The c hart indicates that Utsunomiya is not only a team that attempts many three-p oin t shots, but also shows a relativ ely strong preference for taking ab o ve the break three-p oin ters when sho oting oﬀ the dribble. This qualitative agreemen t with Utsunomiy a’s reputation as a three-p oin t-oriented team ( Onuma , 2025 ) oﬀers a use- ful sanit y chec k on the ST AMP estimates, suggesting that the mo del captures salien t asp ects of their oﬀensive proﬁle. T o illustrate temp oral comparisons, Figure 4 presents side-adjusted step_pull c harts for Nagoy a Diamond Dolphins in the 2023-24 and 2024-25 seasons, display ed side b y side. Because the color scale and p ercentile deﬁnition are shared across panels, c hanges in shading and lab els directly reﬂect shifts in the team’s relativ e use of each region from season to season. In this wa y , ST AMP-based maps provide an in terpretable league-normalized summary of ho w spatial tendencies of a team remain stable or ev olve. 2.4 Extended team-lev el slop es T o examine whether further team-level heterogeneity in shot-t yp e preferences improv es predictiv e p erformance, w e conducted an additional exp erimen t that augments the ST AMP mo del with extra team-level slopes. Fixing use_ta = 1 , use_jump = 1 , and use_step = 1 , we considered all 2 × 2 4 = 32 model conﬁgurations obtained by toggling the inclusion of team-by-side eﬀects ( use_ts ∈ { 0 , 1 } ) and four additional team-level, season-sp eciﬁc slop es for the remaining shot-type categories, which hav e small spa- tial v ariation: lay_up , floater , rim_finishes , and fade_turn (indicators use_lay , use_float , use_rim , use_fade ). All models w ere ﬁtted under the same PC prior conﬁguration as in the main comparison ( U sd = 1 . 5 , U slope = 1 . 0 ; Section 4.2 ) and ev aluated using d elp d MC post and LPML reg . Selected results are summarized in T able 2 , whic h lists the top 10 extended-slop e conﬁgurations ranked b y d elp d MC post . As sho wn in T able 2 , some highly ﬂexible sp eciﬁcations that turn on most of the additional slopes achiev e slightly larger d elp d MC post than the original full ST AMP mo del, but at the cost of extremely po or LPML reg (e.g., on the order of − 5 × 10 5 ), suggest- ing o verﬁtting and unstable ﬁt to the regular-season data. F o cusing on conﬁgurations that simultaneously p erform well on b oth criteria, the b est trade-oﬀ w as obtained b y the mo del w ith use_ta = 1 , use_ts = 0 , use_jump = 1 , use_step = 1 , and additional team-level slop es only for floater and fade_turn ( use_lay = use_rim =0, use_float = use_fade =1; T able 2 ). This conﬁguration improv ed the p ostseason cri- terion to d elp d MC post ≈ − 3 . 35 × 10 3 , ab out +3 . 0 × 10 1 relativ e to the original full mo del 9 T able 2 T op 10 extended team-level slop e conﬁgurations, ranked by d elpd MC post (larger values indicate b etter out-of-sample predictive p erformance). All mo dels include team × area interactions and jump/step team × area × season slop es ( use_ta = ✓ , use_jump = ✓ , use_step = ✓ ); the table rep orts which additional components are switched on ( ✓ ) or oﬀ (–). The prior scales are ﬁxed at U sd = 1 . 5 and U slope = 1 . 0 . Boldface entries denote the b est v alue among all 32 candidate structures, including those not shown in this table. use_ts use_lay use_float use_rim use_fade d elp d MC post ↑ LPML reg ↑ ✓ ✓ ✓ – ✓ − 3 , 284 − 510 , 493 ✓ ✓ – ✓ ✓ − 3 , 308 − 355 , 010 ✓ ✓ – – ✓ − 3 , 341 − 73 , 155 – ✓ ✓ – ✓ − 3 , 347 − 37 , 184 – ✓ – – ✓ − 3 , 347 − 37 , 404 ✓ ✓ ✓ ✓ – − 3 , 347 − 50 , 299 – ✓ ✓ ✓ ✓ − 3 , 348 − 37 , 297 – – ✓ ✓ ✓ − 3 , 349 − 37 , 232 – ✓ ✓ ✓ – − 3 , 349 − 37 , 453 – – ✓ – ✓ − 3 , 352 − 36 , 806 ( use_ta = use_ts = use_jump = use_step = 1 ), and sim ultaneously increased LPML reg b y roughly 1 . 0 × 10 3 . Ov erall, these results indicate that allo wing additional team-level slop es for a subset of shot t yp es can yield incremental gains, but the marginal improv ement in predictiv e p erformance ov er the original ST AMP sp eciﬁcation is mo dest compared to the gains already obtained from including team × area interactions and shot-type- sp eciﬁc jump/step slop es. F or parsimon y and in terpretability , w e therefore adopt the full ST AMP model without the extra team-level slop es on lay_up , floater , rim_finishes , and fade_turn as our primary sp eciﬁcation, and regard this extended- slop e analysis as a robustness chec k. In applications where the main ob jectiv e is to obtain a concise, stable description of team-level spatial tendencies, such additional slop es may not justify the extra mo del complexity , whereas they can b e selectively in tro duced when one wishes to sharp en relative comparisons for a particular shot-type category of interest. 3 Discussion 3.1 Summary and practical implications The ST AMP model extends conv entional shot-c hart analyses b y join tly mo delling team-lev el ﬁeld-goal attempts across predeﬁned court regions, seasons, and shot types, while remaining computationally feasible for league-scale datasets. The resulting relativ e-rate maps and left–right bias summaries can pro vide coac hes and analysts with interpretable diagnostics of where and how a team tends to generate shots rela- tiv e to the rest of the league, as illustrated by the examples for Utsunomiya Brex and Nago ya Diamond Dolphins (Section 2.3 ). Bey ond the case studies in this pap er, w e believe that the ST AMP mo del has the p oten tial to serve as a routine monitoring and scouting to ol. Because the mo del only 10 requires play-b y-play-lev el shot records and p ossession coun ts, it can b e reﬁt as new games accumulate and embedded into existing analytics dash b oards without additional trac king infrastructure. The p er-possession rate formulation facilitates comparisons fair across teams with diﬀeren t paces, while the p ercen tile-based maps and left-righ t bias summaries can b e ov erlaid on conv entional eﬃciency statistics (e.g., ﬁeld-goal p ercen tage by region) to jointly assess “ho w often” and “ho w well” teams sho ot from diﬀeren t lo cations. In practice, such a to ol could help coaching staﬀ quic kly iden tify st ylistic tendencies and p oten tial blind sp ots. 3.2 Metho dological implica tions and robustness Metho dologically , the ST AMP mo del is sp eciﬁed as a Poisson linear mixed mo del with Gaussian random eﬀects, whic h places it squarely within the class of latent Gaussian mo dels targeted b y INLA. W orking with predeﬁned court regions further helps keep the dimensionalit y of the latent ﬁeld manageable and the resulting components inter- pretable, enabling ﬁtting and inference at league scale without resorting to M CMC or hea vy architectures. Our systematic comparison of structures and prior conﬁgurations suggests that the main substantiv e gains in predictiv e performance arise from including team × area eﬀects together with shot-type-sp eciﬁc slop es for jump shots and step-bac k/pull-up jump shots (Section 2.2 ). This conclusion was robust across all PC prior scale c hoices that passed the prior predictive c hecks, with the richest mo dels consistently ranking at or near the top. The extended exp erimen t with additional team-level slop es for lay_up , floater , rim_finishes , and fade_turn show ed only modest incremen tal impro vemen ts in d elp d MC post relativ e to the original ST AMP sp eciﬁcation. These ﬁndings suggest that, unless a sp eciﬁc application calls for detailed re lativ e comparisons within those particular shot-t yp e categories, the simpler slop e structure app ears to retain most of the predictive and explanatory p o wer while keeping the mo del more parsi- monious. In a similar spirit, side eﬀects are handled in a delib erately simple wa y in the presen t sp eciﬁcation, through additive team × side in teractions only . If an analysis required more detailed side-sp eciﬁc b eha viour, the same framew ork could b e extended to allo w shot-t yp e-dep enden t team × side slop es or other side-v arying comp onen ts, at the cost of additional complexit y and data requiremen ts. F rom a modeling persp ectiv e, the ST AMP model can b e viewed as a prag- matic compromise betw een fully contin uous (marked) p oint-process form ulations (e.g., LGCP) and purely descriptive shot-chart summaries. By aggregating shots to pre- deﬁned court regions and mo delling region-level attempt rates with a log-linear hierarc hical structure, the ST AMP mo del no longer tracks the exact ( x, y ) lo cation of eac h shot. Consequently , it cannot represent very ﬁne-grained within-region pref- erences (for example, a tendency to fav or certain sp ots in the paint). In return, the mo del w orks with a muc h low er-dimensional laten t ﬁeld, enjoys far more replication p er parameter, and can b e easier to ﬁt and interpret at the league scale. Com bined with the partial po oling structure across teams and seasons, this often makes the ST AMP mo del more data-eﬃcien t than con tinuous-space LGCP approaches when applied at the league scale. 11 The choice of predeﬁned regions is also motiv ated by practice. W e adopt a prede- ﬁned partition of the half-court that closely matches the zone la yout used in oﬃcial B.League shot charts, so that the regions remain coarse enough to hav e a clear tac- tical interpretation while still b eing ﬁne enough to reveal substantial diﬀerences in shot selection. In applied analysis and scouting, shot charts are often summarized at this regional resolution, so w orking with these predeﬁned areas preserv es practical in terpretability while enabling eﬃcient partial p ooling. 3.3 Limitations and future directions This study has sev eral limitations regarding scop e and generalizabilit y . All analyses are based on t wo seasons of B.LEAGUE play and are carried out at the team level within a single men’s league. W e do not examine longer historical windows, women’s com- p etitions, lo wer divisions, or multi-league settings, nor do we ﬁt play er-level mo dels. Consequen tly , it remains an op en question how well the present ST AMP sp eciﬁca- tion and its ﬁtted spatial patterns w ould transfer to other contexts, and applying the mo del elsewhere would likely require recalibrating priors and p ossibly adapting random eﬀects. Second, the current work fo cuses exclusively on shot frequency rather than shot outcomes or eﬃciency . W e do not mo del made/missed outcomes, eﬀectiv e ﬁeld-goal p ercen tage, or exp ected points, so the ST AMP mo del as sp eciﬁed here is best viewed as a to ol for comparing where teams tend to generate shots, not ho w eﬃcien t they are from those lo cations. Nev ertheless, the same hierarc hical framework can b e extended to join tly mo del shot volume and outcomes. A natural next step would b e to com bine the current P oisson model for attempt coun ts with an additional binomial (or logisti c) submodel for made/missed outcomes, thereb y deﬁning b oth an exp ected attempt rate (xR) and exp ected p oin ts (xP) at the team-region-shot-t yp e lev el. Suc h an extension would retain the interpretabilit y of the present ST AMP mo del sp eciﬁcation, while oﬀering a uniﬁed view of spatial shooting tendencies and scoring eﬃciency . Third, by construction, the ST AMP model is an areal model: shots are aggregated to predeﬁned court regions and summarized via a log-linear P oisson hierarch y . As a result, the mo del cannot resolve ﬁne-grained v ariation in contin uous shot lo cations within a region, nor do es it explicitly accoun t for defender p ositions, play t yp e, or other con textual factors. Such information is av eraged within regions or indirectly absorb ed in to the random eﬀects, which is appropriate for team-lev el tendency comparisons, but ma y b e limiting for more detailed tactical analysis. If richer mark information, such as pla y-type lab els (e.g., those provided b y commercial tagging services), is av ailable, the same areal framework could b e extended to incorp orate these marks as additional ﬁxed or random eﬀects, or ev en as alternative resp onses, thereb y enabling complementary mo dels that fo cus on oﬀensiv e style or play-con text patterns. An additional mo deling choice concerns the shot-type taxonomy used in the presen t analysis. F or mo deling conv enience and to secure suﬃcient counts p er cell, w e man ually merged several pla y-by-pla y action lab els into six broader shot-type cate- gories (e.g., grouping layups and driving layups, or fadeaw ay and turnaround jump shots). Although these groupings are defensible from a basketball p erspective, they 12 are inevitably somewhat ad ho c. If richer contextual information, such as sho oting p osture, ball tra jectory , or play er tracking data, were a v ailable, the indicator functions that gate the shot-t yp e-speciﬁc random slop es could, in principle, b e replaced b y fea- tures or laten t represen tations learned from those data. This w ould open the do or to more data-driven and potentially more nuanced shot-t yp e partitions within the same hierarc hical framework, while preserving the o verall structure of ST AMP . Bey ond team-lev el applications, extending the ST AMP mo del to play er-level com- parisons would b e a natural next step. A straightforw ard approach w ould b e to embed additional random eﬀects for pla yers nested within teams, allowing the same regional and shot-type structure to describ e within-team v ariation. F or cross-team compar- isons, ho wev er, it may often b e more meaningful to compare play ers who o ccup y similar tactical roles (e.g., ball handlers, sho oters, bigs). In such settings, one could either ﬁt separate models by role or include role indicators and role-speciﬁc random eﬀects so that diﬀerences in spatial tendencies are interpreted relative to pla yers with com- parable resp onsibilities and usage, pro vided that each play er accumulates suﬃcient shot volume. Practically , adding such play er- and role-lev el components also increases the num b er of h yp erparameters and the size of the laten t ﬁeld, which in turn raises the computational cost of INLA. If play er comparison is the only ob jectiv e, a simpler pla yer-lev el shot-chart mo del may therefore be a more computationally con v enient c hoice. More broadly , the ST AMP framework is p oten tially applicable to other inv asion sp orts in which shot actions are spatially structured, such as handball or ice ho c key . In con trast, for sp orts like so ccer, where shots are relatively rare and muc h of the tactical ric hness lies in the build-up pla y , it may b e more practically useful to adapt the same areal Poisson idea to mo del passes, entries in to dangerous zones, or other in termediate actions using an appropriate exposure oﬀset rather than shots themselv es. In this sense, the ST AMP mo del can b e regarded as a general template for spatially structured coun t data that can b e repurp osed according to the most informative action t yp e in eac h sp ort. 4 Metho ds 4.1 Prop osed mo del: shot-type-aw are areal m ultilev el P oisson (ST AMP) mo del In this study , for team i = 1 , . . . , I , season s = 1 , . . . , S , area a = 1 , . . . , A , side d ∈ { Left , Right , Center } , and shot-type k ∈ K , we mo del the num b er of ﬁeld goal attempts in the cell ( i, s, a, d, k ) , denoted by Y isadk , using the team × season p ossession coun t E is as an exp osure oﬀset. Throughout, we adopt a shot-terminated deﬁnition of possession: a p ossession ends at each ﬁeld goal attempt, and an oﬀensiv e reb ound b egins a new p ossession. 13 4.1.1 Lik eliho o d and linear predictor The likelihoo d and linear predictor are giv en by Y isadk ∼ Poisson( µ isadk ) , log µ isadk = log E is + η isadk , (1) where η isadk denotes the linear predictor. W e decomp ose η isadk as η isadk = β 0 + β (season) s + β (side) d + β (shot _ type) k | {z } ﬁxed eﬀects + u i |{z} team (iid) + v a |{z} area (iid) + w ias | {z} team × area (seasonally correlated) + z ids |{z} team × side (season-sp eciﬁc) + M X m =1 γ m ( k ) r ( m ) ias | {z} shot-type-speciﬁc team × area (seasonally correlated) , (2) sub ject to the sum-to-zero constraints I X i =1 u i = 0 , (3) A X a =1 v a = 0 , (4) I X i =1 A X a =1 S X s =1 w ias = 0 , (5) X d ∈{ Left , Right , Cen ter } z ids = 0 ( ∀ i, s ) , (6) I X i =1 A X a =1 S X s =1 r ( m ) ias = 0 ( ∀ m ) , (7) where, E is is the num b er of p ossessions for team i in season s (oﬀset term); u i is an i.i.d. random eﬀect for team i ; and v a is an i.i.d. random eﬀect for area a . The terms w ias are seasonally correlated team × area eﬀects; for eac h ( i, a ) we assume an equicorrelation structure corr( w ias , w ias ′ ) = ρ ta for s  = s ′ . F or team × side, we use season-sp eciﬁc i.i.d. random eﬀects z ids with zero-sum constrain ts P d z ids = 0 for eac h ( i, s ) , but do not imp ose an explicit temp oral correlation across seasons . Since side is a coarse factor (Left/Righ t/Center) and the corresp onding coun ts are relatively abundan t, we mainly in tro duce seasonal correlation for the more gran ular and data-sparse team × area and shot-t yp e-sp eciﬁc eﬀects. 14 The collection { r ( m ) ias } M m =1 denotes shot-type-dep endent random slop es for team × area; for eac h m and ( i, a ) these are also mo deled as seasonally correlated across s , analogously to w ias . The function γ m ( k ) is a known co ding function for shot-type k . As the simplest example, using the indicator function based on the family of shot-type sets {K m } M m =1 , γ m ( k ) = 1 { k ∈ K m } , (8) ( 2 ) can b e written as η isadk = · · · + M X m =1 1 { k ∈ K m } r ( m ) ias , (9) and this is the sp eciﬁcation adopted in our exp eriments. Note that each ﬁxed eﬀect is treated as a deviation from a reference category ( N (0 , 100 2 ) is used as a nearly uninformed prior). Because including a uniform interaction for all shot-types would cause ov er- parameterization and destabilize cells ( i, s, a, d, k ) with few shots, w e assign random slop es only to a limited set of shot-types and treat the remaining shot-type eﬀects as ﬁxed eﬀects. This design keeps the mo del additiv e on the log scale and stabilizes computation; together with sum-to-zero cons train ts that ﬁx the ov erall baseline and regularizing h yp erpriors (described below) that shrink the v ariances and temp oral correlations of the seasonally correlated eﬀects tow ards their base mo dels, it yields an appropriate degree of partial p ooling. F urthermore, the equicorrelation structure across seasons for team × area and shot-type-sp eciﬁc eﬀects allows us to capture b oth p ersistence and year-to-y ear v ariation, while the season-sp eciﬁc team × side eﬀects ﬂex- ibly absorb coarse directional biases within eac h season. Note that imposing, for eac h season and area, a sum-to-zero constraint across teams on the season-correlated eﬀects could yield more discriminativ e (i.e., more relative) team diﬀerences, but we did not pursue this due to implemen tation diﬃculties. F rom a lik eliho od p erspective, we retain a P oisson sp eciﬁcation with a possession- based oﬀset rather than switching to a negative-binomial lik eliho od with an additional disp ersion parameter. In our setting, the dominan t sources of ov erdisp ersion are het- erogeneit y across teams, regions, and shot types, which are already mo deled through the random eﬀects; adding a separate disp ersion parameter at the lik eliho o d level w ould eﬀectively la yer extra o verdispersion on top of these comp onen ts, complicat- ing in terpretation and iden tiﬁability . Prior predictiv e chec ks (Section 4.2 ) did not indicate substan tial lack of ﬁt at the aggregated cell level, supp orting the adequacy of the P oisson form ulation. F or similar reasons, we do not imp ose additional spa- tial smo othing priors (e.g., conditional autoregressive structure) across neighboring regions. Instead, predeﬁned court regions are treated as discrete areal categories with indep enden t random eﬀects (i.e., we do not imp ose any spatial smo othing across neigh- b oring regions): this preserv es sharp team-speciﬁc hot and cold sp ots, a voids blurring practically meaningful contrasts b et w een regions, and keeps the laten t Gaussian ﬁeld suﬃcien tly low-dimensional for eﬃcient INLA-based inference. 15 4.1.2 Priors for random eﬀects and hyperparameters The random eﬀects are modeled as u i iid ∼ N (0 , σ 2 team ) , (10) v a iid ∼ N (0 , σ 2 area ) , (11) w ia := ( w ia 1 , . . . , w iaS ) ⊤ ∼ N  0 , σ 2 ta R S ( ρ ta )  , (12) z ids iid ∼ N (0 , σ 2 ts ) , (13) r ( m ) ia :=  r ( m ) ia 1 , . . . , r ( m ) iaS  ⊤ ∼ N  0 , σ 2 m R S ( ρ m )  , m = 1 , . . . , M , (14) where R S ( ρ ) denotes the S × S equicorrelation matrix with ones on the diagonal and oﬀ-diagonal elements equal to ρ . F or the v ariance and correlation h yp erparameters, we use PC priors. In particular, for given thresholds U • , V • and tail probabilities α prec , α cor , we specify Pr( σ • > U • ) = α prec , Pr( | ρ • | > V • ) = α cor , (15) so that large standard deviations and correlations close to ± 1 are a priori p enalized, while mo derate v alues remain plausible. In all analyses we ﬁx α prec = 0 . 05 , so that there is only 5% prior probability that an y standard deviation exceeds its scale param- eter U • . F or the season-correlation parameters ρ ta and ρ m w e use PC-cor priors with V • = 0 . 7 and α cor = 0 . 7 , that is, Pr  | ρ • | > 0 . 7  = 0 . 7 , (16) whic h enco des a prior preference for mo derate-to-strong correlations while still shrinking tow ards ρ • = 0 . The detailed calibration of other thresholds using prior predictive c hec ks is describ ed in Section 4.2 . 4.2 Prior sp eciﬁcation and prior predictive c hecks As describ ed ab o ve, we place PC priors on the standard deviations and seasonal corre- lation parameters of the random eﬀects. In practice, we tune the scale parameters U • that control the t ypical size of the standard deviations, and we divide them into tw o groups: a common scale U sd for the main random eﬀects (team, area, and team × area) and a separate scale U slope for the shot-type-sp eciﬁc random slop es r ( m ) ias . W e follow a Bay esian workﬂo w that emphasizes prior predictive chec ks to cali- brate these hyperparameters and detect implausible mo del b eha vior ( Gabry , Simpson, V ehtari, Betancourt, & Gelman , 2019 ). Concretely , for eac h candidate’s c hoice of PC prior scales ( U sd , U slope ) , w e sim ulate from the prior predictive distribution of the ST AMP mo del, and compare summaries of the simulated shot charts to simple empirical b enc hmarks derived from the B.LEAGUE data, as describ ed below. In particular, to assess whether a given pair ( U sd , U slope ) pro duces realistic prior predictiv e b eha vior, we consider the summary statistics of the replicated data. Let 16 T cell , 95 denote the 95 th p ercen tile of the empirical cell-level rates Y isadk /E is across all cells c = ( i, s, a, d, k ) , and let T tot , 95 denote the 95 th p ercen tile of the team-by- season total coun ts P a,d,k Y isadk o ver ( i, s ) . F or each conﬁguration of ( U sd , U slope ) w e generate 800 prior predictive replicates and compute tw o-sided p -v alues p cell , 95 = 2 min  Pr( T rep cell , 95 ≤ T obs cell , 95 ) , Pr( T rep cell , 95 ≥ T obs cell , 95 )  , (17) and analogously p tot , 95 for T tot , 95 . The probabilities are approximated by empirical prop ortions ov er the 800 replicates. F ollowing a conv en tional prior predictive c heck, w e regard a conﬁguration as acceptable if b oth p cell , 95 and p tot , 95 exceed a nominal threshold (here 0 . 05 ). W e p erformed a grid searc h ov er U sd ∈ { 0 . 5 , 0 . 8 , 1 . 0 , 1 . 2 , 1 . 5 } , U slope ∈ { 0 . 5 , 0 . 8 , 1 . 0 , 1 . 2 , 1 . 5 } , (18) resulting in 25 candidate conﬁgurations. Among these, 7 pairs satisﬁed the prior predictiv e criterion ( p cell , 95 > 0 . 05 and p tot , 95 > 0 . 05 ): ( U sd , U slope ) ∈ { (1 . 5 , 1 . 5) , (1 . 5 , 1 . 2) , (1 . 5 , 1 . 0) , (1 . 5 , 0 . 8) , (1 . 5 , 0 . 5) , (1 . 2 , 1 . 2) , (1 . 2 , 0 . 5) } . (19) W e regard these seven settings as admissible priors and use all of them in a subse- quen t prior sensitivit y analysis. F or brevit y , we rep ort model-comparison results for a represen tative conﬁguration from this admissible set. 4.3 Mo del ﬁttin g and ev aluation All mo dels are ﬁtted in a Bay esian framework using the latent Gaussian struc- ture describ ed in Section 4.1 and the PC priors and prior predictiv e calibration in Section 4.2 . In what follows, we distinguish b et w een r e gular-se ason p ossessions, whic h are used for mo del ﬁtting, and p ost-se ason p ossessions, which are used only for out-of-sample ev aluation (see Section 2.1 for details of the dataset construction). 4.3.1 Data split (ﬁtting and ev aluation) W e train all mo dels on regular-season p ossessions and ev aluate their out-of-sample p erformance on p ost-season p ossessions, reﬂecting the intended deploymen t scenario in whic h a model ﬁtted on past seasons is used to assess p ost-season tendencies. In total, the dataset contains 306 , 444 ﬁeld-goal attempts, of which 296 , 768 shots ( 96 . 84% ) o ccur in the regular season and 9 , 676 shots ( 3 . 16% ) o ccur in the p ost-season. Regular-season shots con tribute to the likelihoo d used for parameter estimation, whereas post-season shots are held out and used only for out-of-sample predictiv e ev aluation. 4.3.2 Mo del conﬁgurations W e compare a family of 16 candidate sp eciﬁcations obtained by switching on or oﬀ four groups of random eﬀects. Concretely , we in tro duce binary indicators 17 use_ta , use_ts , use_jump , use_step ∈ { 0 , 1 } con trolling the inclusion of team × area eﬀects w ias ( use_ta ), team × side eﬀects z ids ( use_ts ), and shot-type-sp eciﬁc random slop es for jump shots ( use_jump ) and step-back / pull-up jump shots ( use_step ). W e then consider all combinations { use_ta , use_ts , use_jump , use_step } ∈ { 0 , 1 } 4 , (20) so that the most complex sp eciﬁcation uses all four components and the simplest one includes only the ﬁxed eﬀects and the team- and area-lev el main eﬀects. Random slop es are introduced only for the jump_shot and step_pull categories. These shot-t yp es ha ve suﬃcien tly many attempts per team to supp ort additional parameters, and they exhibit substantially larger spatial v ariability than other cate- gories, making them natural candidates for capturing team-sp eciﬁc spatial tendencies. F or the remaining shot-t yp es w e retain only ﬁxed eﬀects, whic h helps av oid o ver- parameterization and instability in cells c with few observ ations. All candidate mo dels share the same ﬁxed-eﬀect structure and the same family of h yp erpriors; only the presence or absence of these random-eﬀect blo c ks diﬀers. 4.3.3 Fitting via INLA F or each mo del conﬁguration, we aggregate the play-b y-pla y data to the cell level c and form regular-season counts Y reg c , p ost-season counts Y post c , and team-by-season exp osure terms E reg is and E post is . The mo del in ( 2 ) is then ﬁtted to the regular-season data by treating Y reg c as the resp onse and log E reg is as the oﬀset. W e use a Poisson lik eliho o d with a log link and exploit the latent Gaussian structure to p erform approx- imate Bay esian inference via INLA, as implemented in the R-INLA pac k age. F or each ﬁt w e request p osterior summaries for all ﬁxed and random eﬀects and for the lin- ear predictors η c , and enable conﬁguration sampling ( config = TRUE ) so that we can dra w p osterior samples from the latent Gaussian ﬁeld using inla.posterior.sample . These p osterior draws are later used to ev aluate out-of-sample predictive p erformance on the p ost-season data via Monte Carlo integration of the log predictive densit y . All random-eﬀect blo cks are implemented as indep enden t Gaussian eﬀects with model="iid" in R-INLA . Seasonally correlated team × area eﬀects w ias and shot- t yp e-sp eciﬁc slop es r ( m ) ias are represented using the group mechanism with group = season and an exc hangeable correlation structure across seasons, corresp onding to the equicorrelation matrix R S ( ρ ) in Section 4.1 . The team × side eﬀects z ids are modeled as season-speciﬁc i.i.d. eﬀects without an explicit temp oral correlation, with zero- sum constraints ov er d for eac h ( i, s ) imp osed via the extraconstr argument. T o impro ve n umerical stability we add a small ridge term to the ﬁxed eﬀects (through control.fixed ) and use strategy="adaptive" and int.strategy="ccd" for the INLA internal appro ximations. 4.3.4 Criterion on ﬁtting (regular-season) data On the regular-season data used for ﬁtting, we monitor the log pseudo-marginal lik e- liho od (LPML; Geisser and Eddy ( 1979 ); Held, Schrödle, and Rue ( 2009 )). LPML is 18 deﬁned as LPML reg = X c log  CPO c  , (21) where CPO c denotes the conditional predictive ordinate for cell c (regular-season observ ation), as returned by INLA. Larger v alues of LPML reg corresp ond to b etter in- sample predictive ﬁt. In the result tables w e rep ort LPML reg as a secondary diagnostic to c heck that the mo dels selected by the p ost-season criterion are also reasonable on the ﬁtting data. 4.3.5 Out-of-sample ev aluation on the p ost-season Our primary mo del-comparison criterion is the exp ected log predictiv e densit y on the p ost-season data. F or each ﬁtted mo del we keep the p osterior obtained from regular- season data and ev aluate predictive performance on the held-out p ost-season counts Y post c with exp osure E post is . Let θ denote the collection of ﬁxed and random eﬀects and h yp erparameters; for each cell c with E post is > 0 we consider the log predictiv e densit y log p  Y post c | θ  , (22) where the Poisson mean is µ post c = E post is exp( η c ) . F rom each INLA ﬁt, we dra w J p osterior samples θ (1) , . . . , θ ( J ) of the latent Gaus- sian ﬁeld (here J = 400 ), and approximate the p oin twise log predictive densities by Mon te Carlo, b ℓ c = log 1 J J X j =1 p  Y post c | θ ( j )  ! . (23) Summing o ver all p ost-season cells yields the Monte Carlo estimate of the exp ected log predictive densit y , d elp d MC post = X c b ℓ c . (24) W e use d elp d MC post as the primary selection criterion, with larger v alues indicating better out-of-sample predictive p erformance on p ost-season shot c harts. In the result tables w e rep ort d elp d MC post together with LPML reg to jointly assess extrap olativ e performance and ﬁt to the regular-season data. All computations are carried out in R using a single BLAS thread for repro- ducibilit y . With the presen t dataset (on the order of 3 × 10 5 shots aggregated to team-season-area-side-shot-t yp e cells), each INLA ﬁt completes w ithin roughly a min ute on a standard workstation-class CPU. 19 4.3.6 A dditional team-level shot-t yp e slop es In addition to the main family of 16 candidate mo dels based on team × area and shot-t yp e-sp eciﬁc slopes for jump and step-bac k/pull-up shots (Section 4.3 ), we also considered an extended set of mo dels that in tro duce team-level shot-type slop es for the remaining categories ( lay_up , floater , rim_finishes , fade_turn ). Concretely , w e kept the team × area slop es for jump_shot and step_pull unchanged and switched on/oﬀ the presence of team-level slop es for each of the four additional shot types, com bined with use_ts ∈ { 0 , 1 } , yielding 2 × 2 4 = 32 additional candidates. The prior h yp erparameters were ﬁxed to the v alues calibrated in Section 4.2 , with U sd = 1 . 5 , U slope = 1 . 0 , α prec = 0 . 05 and V = 0 . 7 , α cor = 0 . 7 . W e ev aluated these extended mo d- els in the same w ay as the main candidates, using d elp d MC post on p ost-season data and LPML reg on regular-season data. Comp eting in terests The authors hav e no comp eting interests to declare that are relev ant to the conten t of this article. Author Con tributions K.Y. contributed to the study conception and design. Data preparation, mo deling, and analysis were p erformed by K.Y. The ﬁrst draft of the manuscript was con tributed b y K.Y. and K.F. All authors read and approv ed the ﬁnal man uscript. Data a v ailability The data of the researc h is obtained by participating in a comp etition hosted by the academic organizations. The central idea of this study was indep enden t of the comp etition (not restricted by the comp etition). Data acquisition was based on the con tract betw een the basketball league and Data Stadium, Inc., but not b et w een the pla yers/teams and us. F unding Information K. Y amada gratefully ackno wledges supp ort from the “THERS Make New Standards Program for the Next Generation Researchers”. This w ork was ﬁnancially supported b y JSPS KAKENHI Gran t Num b er 23H03282. A ckno wledgments. The data used in this study w ere pro vided by the Insti- tute of Statistical Mathematics, Academ y for Statistics and Data Science, Research Organization of Information and Systems, and Data Stadium Inc. 20 References Anzer, G., & Bauer, P . (2021). A goal scoring probability mo del for shots based on sync hronized p ositional and ev ent data in football (so ccer). F r ontiers in Sp orts and A ctive Living , V olume 3 - 2021 , , https://doi.org/10.3389/fspor .2021.624475 Brec hot, M., & Flepp, R. (2020). Dealing with randomness in matc h outcomes: Ho w to rethink p erformance ev aluation in europ ean club fo otball using exp ected goals. Journal of Sp orts Ec onomics , 21 (4), 335-362, https://doi.org/10.1177/ 1527002519897962 Cao, J., Cai, Q., W aller, L.A., Hickson, D.A., Hu, G., Kang, J. (2025). What inﬂuenc es the ﬁeld go al attempts of pr ofessional players? analysis of b asketb al l shot charts via lo g gaussian c ox pr o c esses with sp atial ly varying c o eﬃcients. Retrieved from h Cao, J., Y ang, H.-C., Hu, G. (2025). How do the pr ofessional players sele ct their shot lo c ations? an analysis of ﬁeld go al attempts via b ayesian additive r e gr ession tr e es. Retrieved from h Ca vus, M., & Biecek, P . (2022). Explainable expected goal mo dels for p erformance analysis in fo otball analytics. 2022 ie e e 9th international c onfer enc e on data scienc e and advanc e d analytics (dsaa) (p. 1-9). Chen, T., & Guestrin, C. (2016). Xgb o ost: A scalable tree b o osting system. (p. 785–794). New Y ork, NY, USA: Association for Computing Machinery . Retriev ed from h ttps://doi.org/10.1145/2939672.2939785 Eggels, H., V an Elk, R., P echenizkiy , M. (2016). Explaining so ccer matc h outcomes with goal scoring opportunities predictive analytics. 3r d workshop on machine le arning and data mining for sp orts analytics (mlsa 2016). Ehrlic h, J., & Sanders, S. (2024). Estimating nba team shot selection eﬃciency from aggregations of true, contin uous shot c harts: A generalized additiv e mo del approac h. Continuous Shot Charts: A Gener alize d A dditive Mo del Appr o ach (January 16, 2024) , , F airchild, A., P elechrinis, K., K okkodis, M. (2018). Spatial analysis of shots in mls: A mo del for exp ected goals and fractal dimensionality . Journal of Sp orts Analytics , 4 (3), 165-174, https://doi.org/10.3233/JSA-170207 21 Gabry , J., Simpson, D., V eh tari, A., Betancourt, M., Gelman, A. (2019, 01). Visual- ization in bay esian w orkﬂow. Journal of the R oyal Statistic al So ciety Series A: Statistics in So ciety , 182 (2), 389-402, https://doi.org/10.1111/rssa.12378 Geisser, S., & Eddy , W.F. (1979). A predictive approac h to mo del selection. Journal of the A meric an Statistic al Asso ciation , 74 (365), 153–160, Held, L., Schrödle, B., Rue, H. (2009). Posterior and cross-v alidatory predictiv e chec ks: a comparison of mcmc and inla. Statistic al mo del ling and r e gr ession structur es: F estschrift in honour of ludwig fahrmeir (pp. 91–110). Springer. Hewitt, J.H., & Karakuş, O. (2023). A machine learning approach for play er and p osition adjusted expected goals in fo otball (soccer). F r anklin Op en , 4 , 100034, h ttps://doi.org/https://doi.org/10.1016/j.fraope.2023.100034 Retrieved from h ttps://www.sciencedirect.com/science/article/pii/S2773186323000282 Ho, K.-W. (2025). A ba yesian negative binomial-b ernoulli model with tensor decom- p osition: Application to jointly analyzing shot attempts and shot successes in bask etball games. Journal of Data Scienc e , 1–15, h ttps://doi.org/10.6339/ 25-JDS1196 Hu, G., Y ang, H.-C., Xue, Y. (2021). Bay esian group learning for shot selection of professional basketball pla yers. Stat , 10 (1), e324, Jiao, J., Hu, G., Y an, J. (2021). A ba yesian marked spatial p oin t processes mo del for bask etball shot chart. Journal of Quantitative Analysis in Sp orts , 17 (2), 77–90, Kam bhamettu, A.R., Shriv astav a, A., Gwilliam, M. (2024). Quan tifying nba shot qualit y: A deep net work approach. Pr o c e e dings of the 7th acm international workshop on multime dia c ontent analysis in sp orts (pp. 91–95). Lucey , P ., Bialk owski, A., Mon fort, M., Carr, P ., Matthews, I. (2015). qualit y vs quantit y: Improv ed shot prediction in so ccer using strategic features from spatiotemp oral data. Mead, J., O’Hare, A., McMenem y , P . (2023, 04). Expected goals in fo ot- ball: Improving mo del p erformance and demonstrating v alue. PLOS ONE , 18 (4), 1-29, h ttps://doi.org/10.1371/journal.p one.0282295 Retriev ed from h ttps://doi.org/10.1371/journal.p one.0282295 22 Miller, A., Bornn, L., Adams, R., Goldsb erry , K. (2014). F actorized p oin t pro cess in tensities: A spatial analysis of professional bask etball. International c onfer enc e on machine le arning (pp. 235–243). Møller, J., Syversv een, A.R., W aagep etersen, R.P . (1998). Log gaussian cox pro cesses. Sc andinavian journal of statistics , 25 (3), 451–482, On uma, K. (2025, Ma y 1). Cs p articip ating te ams intr o duction 1. utsunomiya br ex “over c oming the obstacles and sadness of last se ason to r e claim the championship”. B.MAGAZINE , B.LEA GUE oﬃcial w ebsite. Retriev ed from h ttps://www.bleague.jp/bmagazine/detail/id=499990 (In Japanese. Accessed 4 December 2025) P ardo, P .M. (2020). Cr e ating a mo del for exp e cte d go als in fo otb al l using qualitative player information (Unpublished do ctoral dissertation). Universitat Politècnica de Cataluny a. F acultat d’Informàtica de Barcelona. Rathk e, A. (2017). An examination of exp ected goals and shot eﬃciency in so ccer. Journal of Human Sp ort and Exer cise , 12 (2), 514–529, Reic h, B.J., Ho dges, J.S., Carlin, B.P ., Reich, A.M. (2006). A spatial analysis of bask etball shot c hart data. The Americ an Statistician , 60 (1), 3–12, Rue, H., Martino, S., Chopin, N. (2009). Approximate ba yesian inference for latent gaussian mo dels b y using integrated nested laplace appro ximations. Journal of the R oyal Statistic al So ciety: Series B (Statistic al Metho dolo gy) , 71 (2), 319-392, h ttps://doi.org/https://doi.org/10.1111/j.1467-9868.2008.00700.x Sandholtz, N., Mortensen, J., Bornn, L. (2020). Measuring spatial allocative eﬃciency in basketball. Journal of Quantitative Analysis in Sp orts , 16 (4), 271– 289, https://doi.org/doi:10.1515/jqas-2019-0126 Retriev ed 2025-10-14, from h ttps://doi.org/10.1515/jqas-2019-0126 Sc hmid, M., Schöpf, M., K olbinger, O. (2025). Getting nba shots in con text: Analysing bask etball shots with graph embeddings. Journal homep age: http://iacss. or g/index. php? id , 24 (1), , Sc holtes, A., & Karakuş, O. (2024). Ba yes-xg: play er and p osition correction on exp ected goals (xg) using bay esian hierarchical approach. F r ontiers in Sp orts and 23 A ctive Living , V olume 6 - 2024 , , https://doi.org/10.3389/fspor.2024.1348983 Scrucca, L., & Karlis, D. (2025). A mo del-based approach to shot charts estimation in basketball. Computational Statistics , 40 (4), 2031–2048, Simpson, D., Rue, H., Riebler, A., Martins, T.G., Sørby e, S.H. (2017). P enalising mo del comp onen t complexity: A principled, practical approac h to constructing priors. T ureen, T., & Olthof, S. (2022). “estimated pla yer impact”(epi): Quantifying the eﬀects of individual play ers on fo otball (so ccer) actions using hierarc hical statistical mo dels. Statsb omb c onfer enc e pr o c e e dings. W ong-T oi, E., Y ang, H.-C., Shen, W., Hu, G. (2023). A joint analysis for ﬁeld goal attempts and p ercentages of professional bask etball play ers: Bay esian nonparametric resource. Journal of Data Scienc e , 21 (1), , Xu, Y., Bretzner, L., W ang, T., Maki, A. (2025). Skor-xg: Sk eleton-oriented exp ected goal estimation in so ccer. Pr o c e e dings of the c omputer vision and p attern r e c o gnition c onfer enc e (pp. 5967–5977). Y amada, K., & F ujii, K. (2025). T wo clusterings to capture bask etball pla yers’ sho ot- ing tendencies using tracking data: clustering of sho oting st yles and the shots themselv es. International Journal of Computer Scienc e in Sp ort , 24 (1), 35–55, h ttps://doi.org/10.2478/ijcss-2025-0003 Y eung, C., Sit, T., F ujii, K. (2025). T ransformer-based neural marked spatio temp oral p oin t process mo del for analyzing football match even ts. Applie d Intel ligenc e , 55 (5), 335, Yin, F., Hu, G., Shen, W. (2023). Analysis of professional bask etball ﬁeld goal attempts via a ba yesian matrix clustering approac h. Journal of Computational and Gr aphic al Statistics , 32 (1), 49–60, Zuccolotto, P ., Sandri, M., Manisera, M. (2021). Spatial p erformance indicators and graphs in basketball. So cial Indic ators R ese ar ch , 156 (2), 725–738, 24 Zuccolotto, P ., Sandri, M., Manisera, M. (2023). Spatial p erformance analysis in bask etball w ith cart, random forest and extremely randomized trees. Annals of Op er ations R ese ar ch , 325 (1), 495–519, 25

STAMP: A shot-type-aware areal multilevel Poisson model for league-wide comparison of basketball shot charts

Original Paper

Comments & Academic Discussion

Leave a Comment

Original Paper

Related Papers

Comments & Academic Discussion

Leave a Comment