Learn about our methodology, data sources, and the team behind Property Analytics London.
Property Analytics uses a cross-sectional factor-based approach to analyse the London real estate market, similar to those widely used in equity market analysis. This methodology allows us to decompose property returns into specific factors that drive market prices.
Our model was developed through rigorous statistical analysis of London property transaction data from 1995. We employed multiple regression techniques to identify the key factors that consistently explain variations in property prices across different market cycles.
Decomposing returns into factors can be done in mainly three ways: a purely statistical approach, time-series regression, and cross-sectional regression. The first one is hard to interpret, and the second one requires knowing the factor returns a priori. The third one is the most powerful and therefore widely used for stocks. All it requires is knowing the characteristics of each property and regressing the prices on those to derive the factor returns. Fortunately, this is very easy for properties where we can use, for example, the number of rooms, but it's somewhat harder for stocks.
At each month t, we fit a cross-sectional OLS of log(price/sqm) on property characteristics, each centred on its window mean X̄j,window — the mean of Xj over the transactions in the rolling 3-month window:
\[ y_i \;=\; \alpha_t \;+\; \sum_{j=1}^{k} \beta_{t,j}\bigl(X_{i,j} - \bar{X}_{j,\text{window}}(t)\bigr) \;+\; \varepsilon_i \]
Centring makes the intercept αt interpretable as the log-price of the average transaction in the current window. Each βt,j is the premium paid per unit of characteristic j at time t.
The period-on-period change in quality-adjusted log-price decomposes via a first-order Taylor expansion evaluated at t:
\[ \Delta \hat{y}(t) \;=\; \Delta\alpha_t \;+\; \sum_{j=1}^{k} \Bigl[\underbrace{\beta_{t,j}\,\Delta X_j}_{\text{composition}} \;+\; \underbrace{X_j(t)\,\Delta \beta_{t,j}}_{\text{factor return}} \;-\; \underbrace{\Delta \beta_{t,j}\,\Delta X_j}_{\text{cross term}}\Bigr] \]
The first two terms use current-period weights (βt, Xt). This is not an exact first-order decomposition of d(β·X) — it over-counts by Δβ·ΔX. We track that cross term explicitly as its own column in factor_returns.csv so the decomposition closes:
By construction Baseline + Σ Factors + Cross Term exactly equals the cumulative change in ŷ(t). The cross term is usually small in practice (second-order in monthly changes) but is plotted explicitly on the Trends page so you can see the bias it would introduce under a naive current-values split.
We started with a comprehensive set of potential factors and systematically eliminated those with low explanatory power or high correlation with other factors. The final model includes 8 factors that collectively provide a robust framework for understanding London property prices:
We validate our model using rigorous statistical tests, like ratio between residuals and factor returns, low factor cross-correlations, and low autocorrelations. We also do out-of-sample testing, comparing its predictions against actual market transactions.
A key choice in any hedonic index is the reference basket: the average property whose characteristics are used to translate fitted betas into a price level. We use the actual window mean of each characteristic at time t, so the basket evolves with what is being transacted rather than being frozen on a single representative property.
This decomposition gives two complementary readings per factor: a factor return (Xt·Δβj, pure market repricing of characteristic j at today's basket weight) and a composition term (βt·ΔXj, the contribution from the basket itself shifting). The composition piece is rolled into Baseline so the factor-return chart stays a clean repricing signal; the Factors page exposes both views per factor.
The basket has shifted materially over 30 years. The charts below track the four dimensions that move the most in the underlying transactions, each expressed as a share of monthly volume:
For reference, the table below summarises the average sold property in four illustrative London market eras:
| Characteristic | 1995–2007 | 2008–2015 | 2016–2019 | 2020–present |
|---|---|---|---|---|
| Market era | Pre-crisis | Post-GFC / Help to Buy | Help to Buy peak / Brexit | COVID space race |
| % Flat | 53% | 56% ▲ | 60% ▲▲ | 57% ▼ |
| % Freehold | 50% | 46% ▼ | 43% ▼▼ | 46% ▲ |
| % New build | 8% | 10% ▲ | 16% ▲▲ | 10% ▼ |
| EPC A/B rated | 2% | 7% ▲ | 15% ▲▲ | 16% = |
| Avg. rooms | 3.9 | 3.8 ▼ | 3.8 = | 3.8 = |
| Construction period | Pre-1960s | Mixed ▲ | Mixed = | Post-1990s ▲ |
| Floor area | 79 sqm | 80 sqm = | 79 sqm = | 80 sqm = |
The 2016 to 2019 period stands out: Help to Buy drove the highest share of new builds and flats ever recorded in London transactions, while EPC standards pushed A/B rated properties from 2% to 15% of sales. Because the regression uses the live basket at each window, these compositional shifts get attributed to the composition term per factor and rolled into Baseline, rather than contaminating the factor returns.
|
One way we assess the robustness of the factor model is by looking at the pairplot of its factors. We make sure their correlation is low, which is an indication of good quality factors. One the left-hand side we show one of these plots indicating the correlation is generally very low. |
|
Our analysis is based on a comprehensive dataset of London property transactions and related information from multiple authoritative sources:
The raw data underwent extensive cleaning, normalization, and integration to create a unified dataset suitable for factor analysis. We employed various statistical techniques to handle missing values, outliers, and data inconsistencies.
While our dataset is comprehensive, it has certain limitations:
Property Analytics London was developed by a team of data scientists, economists, and real estate professionals passionate about bringing data-driven insights to the London property market.
Our team combines expertise in:
We believe that better data leads to better decisions. Our mission is to democratize access to sophisticated property market analysis, helping homebuyers, investors, and property professionals make more informed decisions in the London real estate market.
Have questions about our methodology, data sources, or upcoming property analysis tool? We'd love to hear from you.