Parameter Estimation of the Ideal Distribution of Visual Meteor Magnitudes

Introduction

The density of the ideal distribution of meteor magnitudes is \[ {\displaystyle f(m) = \frac{\mathrm{d}p}{\mathrm{d}m} = \frac{3}{2} \, \log(r) \sqrt{\frac{r^{3 \, \psi + 2 \, m}}{(r^\psi + r^m)^5}}} \] where $m$ denotes the continuous (real-valued) meteor magnitude, $r = 10^{0.4} \approx 2.51189 \dots$ is a constant, and $\psi$ is the only parameter of this magnitude distribution.

In visual meteor observations, magnitudes are usually estimated as integer values. Hence, this distribution is discrete and its probability mass function is given by \[ P[M = m] \sim \begin{cases} g(m_{\mathrm{lim}} - m) \displaystyle \int\limits_{m-0.5}^{m+0.5} f(u) \, \mathrm{d}u, & \text{if } m_{\mathrm{lim}} - m > -0.5,\\[5pt] 0 & \text{otherwise,} \end{cases} \] where $m_{\mathrm{lim}}$ denotes the limiting (non-integer) magnitude of the observation, and $m$ the integer meteor magnitude. The function $f(\cdot)$ is the continuous density of the ideal magnitude distribution, and $g(\cdot)$ denotes the perception probability function.

Here we demonstrate two methods to estimate the parameter $\psi$.

First, we obtain some magnitude observations from the example data set, which also includes the limiting magnitude.

observations <- with(PER_2015_magn$observations, {
    idx <- !is.na(lim_magn) & sl_start > 135.81 & sl_end < 135.87
    data.frame(
        magn_id = magn_id[idx],
        lim_magn = lim_magn[idx]
    )
})
head(observations, 5) # Example values

magn_id	lim_magn
225413	5.30
225432	5.95
225438	6.01
225449	6.48
225496	5.50

Next, the observed meteor magnitudes are matched with the corresponding observations. This is necessary as we need the limiting magnitudes of the observations to determine the parameter.

Using

magnitudes <- with(new.env(), {
    magnitudes <- merge(
        observations,
        as.data.frame(PER_2015_magn$magnitudes),
        by = "magn_id"
    )
    magnitudes$magn <- as.integer(as.character(magnitudes$magn))
    subset(magnitudes, (magnitudes$lim_magn - magnitudes$magn) > -0.5)
})
head(magnitudes[magnitudes$Freq > 0, ], 5) # Example values

we obtain a data frame with the absolute observed frequencies Freq for each observation of a magnitude class. The expression subset(magnitudes, (magnitudes$lim_magn - magnitudes$magn) > -0.5 ensures that meteors fainter than the limiting magnitude are not used if they exist.

	magn_id	lim_magn	magn	Freq
9	225413	5.30	4	1.0
11	225413	5.30	1	2.0
14	225413	5.30	3	3.0
15	225432	5.95	4	2.0
17	225432	5.95	3	1.5

This data frame contains a total of 97 meteors. This is a sufficiently large number to estimate the parameter.

Maximum Likelihood Method

The maximum likelihood method can be used to estimate the parameter in an asymptotically unbiased manner. For this, the function dvmideal() is needed, which returns the probability density of the observable meteor magnitudes when the parameter and the limiting magnitudes are known.

The following algorithm estimates the parameter by maximizing the likelihood with the optim() function. The function ll() returns the negative log-likelihood, as optim() identifies a minimum.

# maximum likelihood estimation (MLE) of psi
result <- with(magnitudes, {
    # log likelihood function
    ll <- function(psi) -sum(Freq * dvmideal(magn, lim_magn, psi, log = TRUE))
    psi_start <- 6.0 # starting value
    psi_lower <- 4.0 # lowest expected value
    psi_upper <- 10.0 # highest expected value
    # find minimum
    optim(psi_start, ll, method = "Brent", lower = psi_lower, upper = psi_upper, hessian = TRUE)
})

This gives the expected value and the variance of the parameter:

psi_mean <- result$par # mean of psi
print(psi_mean)
#> [1] 6.116217
psi_var <- 1 / result$hessian[1][1] # variance of psi
print(psi_var)
#> [1] 0.3335565

We can additionally visualize the likelihood function here.

with(new.env(), {
    data_plot <- data.frame(psi = seq(4.0, 11, 0.1))
    data_plot$ll <- mapply(function(psi) {
        with(magnitudes, {
            # log likelihood function
            sum(Freq * dvmideal(magn, lim_magn, psi, log = TRUE))
        })
    }, data_plot$psi)
    data_plot$l <- exp(data_plot$ll - max(data_plot$ll))
    data_plot$l <- data_plot$l / sum(data_plot$l)
    plot(data_plot$psi, data_plot$l,
        type = "l",
        col = "blue",
        xlab = "psi",
        ylab = "likelihood"
    )
    abline(v = result$par, col = "red", lwd = 1)
})

It is clearly visible that the likelihood function is not normally distributed. This distribution even belongs to the class of heavy-tailed distributions. While its maximum is indeed an asymptotically unbiased estimator, this does not hold for the variance. This is important in this context because the variance of the estimated $\psi$-value is derived from the curvature (the second derivative at the maximum) of the log-likelihood function. Therefore, the estimator for the variance of $\psi$ is far too small.

Variance-Stabilizing Transformation as an Alternative Method

Estimation based on the maximum likelihood principle is computationally demanding. As an alternative, a variance-stabilizing transformation can be applied. This transformation maps meteor magnitudes onto a different scale, yielding a distribution whose variance no longer depends on the parameter $\psi$.

The variance-stabilizing transformation has the following additional advantages:

Since the variance does not depend on $\psi$, the accuracy of estimators (e.g., maximum likelihood estimators) is easier to assess,
The variance bound (Cramér–Rao lower bound) then often depends only on the sample size n, but not on the true value of $\psi$,
Estimators such as the sample mean are homoscedastic, i.e., their dispersion is constant across the parameter space. As a result, many classical results of linear estimation theory (BLUE properties, Gauss–Markov theorem) hold without additional transformations,
Because the variance is fixed, the calculation of standard errors is independent of the unknown value of $\psi$,
Confidence intervals can be more easily standardized and are equally well calibrated across the entire parameter space,
Test statistics (e.g., likelihood ratio or score tests) exhibit a more uniform distribution, since the variance does not need to be treated as an additional unknown. This improves test power and simplifies asymptotic approximations.

The resulting procedure is straightforward: it suffices to compute the mean of the transformed meteor magnitudes, from which an estimate of the parameter $\psi$ is obtained.

Note that the variance-stabilizing transformation yields a mean that can be directly analyzed, but converting it back to the $\psi$ parameter requires applying the delta method. This accounts for the nonlinearity of the transformation and provides appropriate uncertainty estimates for $\psi$.

tm_mean <- with(magnitudes, {
    N <- sum(Freq)
    tm <- vmideal_vst_from_magn(magn, lim_magn)
    tm_mean <- sum(Freq * tm) / N
    tm_var <- sum(Freq * (tm - tm_mean)^2) / (N - 1)
    tm_mean_var <- tm_var / N
    list("val" = tm_mean, "var" = tm_mean_var, "sd" = sqrt(tm_mean_var))
})

Thus, one obtains the mean and the variance of the mean of tm.

print(paste("tm mean:", tm_mean$val))
#> [1] "tm mean: 0.0878019525445456"
print(paste("tm var:", tm_mean$var))
#> [1] "tm var: 0.00836015180816983"

Using the bootstrap method, it can be assessed whether the mean is normally distributed.

tm_means <- with(magnitudes, {
    N <- sum(Freq)
    tm <- vmideal_vst_from_magn(magn, lim_magn)
    replicate(50000, {
        mean(sample(tm, size = N, replace = TRUE, prob = Freq))
    })
})

The graphical representation indicates that this is indeed approximately the case.

with(new.env(), {
    tm_min <- tm_mean$val - 3 * tm_mean$sd
    tm_max <- tm_mean$val + 3 * tm_mean$sd
    tm_means <- subset(tm_means, tm_means > tm_min & tm_means < tm_max)
    brks <- seq(min(tm_means) - 0.02, max(tm_means) + 0.02, by = 0.02)
    hist(tm_means,
        breaks = brks,
        col = "skyblue",
        border = "black",
        main = "Histogram of mean tm",
        xlab = "tm",
        ylab = "count",
        xaxt = "n"
    )
    axis(1, at = seq(round(min(brks), 1), round(max(brks), 1) + 0.1, by = 0.1))
    abline(v = 0, col = "red", lwd = 1)
})

A mean value of 0.0 implies that $\psi$ lies at infinity. Negative values can be interpreted as a kind of “beyond infinity”, which is not meaningful. There are two possible explanations:

The distribution is not ideal, i.e., the observations do not fit the model as described above.
Random variation led to this result.

Although the expected value is clearly above 0.0 and can therefore be used, it is more appropriate in this context to interpret the value as the median.

lim_magn_mean <- with(magnitudes, {
    N <- sum(Freq)
    sum(Freq * lim_magn) / N
})
print(paste("lim_magn_mean:", lim_magn_mean))
#> [1] "lim_magn_mean: 5.65752577319588"
print(paste("mean psi:", vmideal_vst_to_psi(tm_mean$val, lim_magn_mean)))
#> [1] "mean psi: 7.00387357931987"

The mean limiting magnitude lim_magn_mean is required to estimate the location parameter $\psi$. In visual meteor observations, different limiting magnitudes usually occur. In practice, the observed quantity is the difference between $\psi$ and the limiting magnitude for each observation. Since $\psi$ is unknown, one must assume that the mean tm is correlated with the limiting magnitude.

If this correlation can be described by a simple linear regression, the mean limiting magnitude serves as a reliable reference point for estimating $\psi$. This is typically the case when the limiting magnitudes of the individual observations differ only slightly, or $\psi$ is smaller than or equal to the limiting magnitudes.

By contrast, if $\psi$ is significantly larger than the limiting magnitude, estimation becomes problematic: $\psi$ effectively tends to infinity, and the observable magnitude distribution approaches the geometric model of visual meteor magnitudes with a population index of $r \approx 2.5$.

Nevertheless, if all observations share the same limiting magnitude, the procedure yields an exact estimate.

In practice, however, it is preferable to use a confidence interval estimate. For example, one can estimate that $\psi$ is, with 10 percent probability, not smaller than:

print(vmideal_vst_to_psi(qnorm(0.90, tm_mean$val, tm_mean$sd), lim_magn_mean))
#> [1] 5.827726

Residual Analysis

So far, we have operated under the assumption that the real distribution of meteor magnitudes is exponential and that the perception probabilities are accurate. We now use the Chi-Square goodness-of-fit test to check whether the observed frequencies match the expected frequencies. Then, using the estimated parameter, we retrieve the relative frequencies p for each observation and add them to the data frame magnitudes:

psi_mean <- vmideal_vst_to_psi(tm_mean$val, lim_magn_mean)
magnitudes$p <- with(magnitudes, dvmideal(m = magn, lm = lim_magn, psi_mean))

We must also consider the probabilities for the magnitude class with the brightest meteors.

magn_min <- min(magnitudes$magn)

The smallest magnitude class magn_min is -6. In calculating the probabilities, we assume that the magnitude class -6 contains meteors that are either brighter or equally bright as -6 and thus use the function pvmideal() to determine their probability.

idx <- magnitudes$magn == magn_min
magnitudes$p[idx] <- with(
    magnitudes[idx, ],
    pvmideal(m = magn + 1L, lm = lim_magn, psi_mean, lower.tail = TRUE)
)

This ensures that the probability of observing a meteor of any given magnitude is 100%. This is known as the normalization condition. Accordingly, the Chi-Square goodness-of-fit test will fail if this condition is not met.

We now create the contingency table magnitutes_observed for the observed meteor magnitudes and its margin table.

magnitutes_observed <- xtabs(Freq ~ magn_id + magn, data = magnitudes)
magnitutes_observed_mt <- margin.table(magnitutes_observed, margin = 2)
print(magnitutes_observed_mt)
#> magn
#>   -6   -5   -4   -3   -2   -1    0    1    2    3    4    5    6 
#>  0.0  0.0  0.0  0.0  3.0  4.0  7.0 10.0 23.0 26.5 20.0  3.0  0.5

Next, we check which magnitude classes need to be aggregated so that each contains at least 10 meteors, allowing us to perform a Chi-Square goodness-of-fit test.

The last output shows that meteors of magnitude class 0 or brighter must be combined into a magnitude class 0-. Meteors with a brightness less than 4 are grouped here in the magnitude class 4+, and a new contingency table magnitudes.observed is created:

magnitudes$magn[magnitudes$magn <= 0] <- "0-"
magnitudes$magn[magnitudes$magn >= 4] <- "4+"
magnitutes_observed <- xtabs(Freq ~ magn_id + magn, data = magnitudes)
print(margin.table(magnitutes_observed, margin = 2))
#> magn
#>   0-    1    2    3   4+ 
#> 14.0 10.0 23.0 26.5 23.5

We now need the corresponding expected relative frequencies

magnitutes_expected <- xtabs(p ~ magn_id + magn, data = magnitudes)
magnitutes_row_freq <- margin.table(magnitutes_observed, margin = 1)
magnitutes_expected <- sweep(magnitutes_expected, 1, magnitutes_row_freq, `*`)
magnitutes_expected <- magnitutes_expected / sum(magnitutes_expected)
print(sum(magnitudes$Freq) * margin.table(magnitutes_expected, margin = 2))
#> magn
#>       0-        1        2        3       4+ 
#> 13.55922 14.04429 20.61834 22.82484 25.95332

and then carry out the Chi-Square goodness-of-fit test:

chisq_test_result <- chisq.test(
    x = margin.table(magnitutes_observed, margin = 2),
    p = margin.table(magnitutes_expected, margin = 2)
)

As a result, we obtain the p-value:

chi2_df <- chisq_test_result$parameter - 1
chi2_pval <- pchisq(chisq_test_result$statistic, df = chi2_df, lower.tail = FALSE)
print(chi2_pval)
#> X-squared 
#> 0.5168011

If we set the level of significance at 5 percent, then it is clear that the p-value with 0.5168011 is greater than 0.05. Thus, under the assumption that the magnitude distribution follows the ideal meteor magnitude distribution and that the perception probabilities are correct (i.e., error-free or precisely known), these assumptions cannot be rejected. However, the converse is not true; the assumptions may not necessarily be correct. The total count of meteors here is too small for such a conclusion.

To verify the p-value, we also graphically represent the Pearson residuals:

chisq_test_residuals <- with(new.env(), {
    chisq_test_residuals <- residuals(chisq_test_result)
    v <- as.vector(chisq_test_residuals)
    names(v) <- names(chisq_test_residuals)
    v
})

plot(
    chisq_test_residuals,
    main = "Residuals of the chi-square goodness-of-fit test",
    xlab = "m",
    ylab = "Residuals",
    ylim = c(-3, 3),
    xaxt = "n"
)
abline(h = 0.0, lwd = 2)
axis(1, at = seq_along(chisq_test_residuals), labels = names(chisq_test_residuals))

Parameter Estimation of the Ideal Distribution of Visual Meteor Magnitudes

2026-05-19

Introduction

Maximum Likelihood Method

Variance-Stabilizing Transformation as an Alternative Method

Residual Analysis