You decide to use a histogram method to estimate the density
Ask Expert

Be Prepared For The Toughest Questions

Practice Problems

You decide to use a histogram method to estimate the density

5. Histogram methods

Consider a dataset {xj} n j=1 where each point x ∈ [0, 1]d . Let f(x) be the true unknown data distribution. You decide to use a histogram method to estimate the density f(x) and divide each dimension into m bins.

(a) Show that for a measurable set S, E[1{x∈S} ] = P(x ∈ S).

(b) Combining the result of the previous question with the Law of Large Numbers, show that the estimated probability of falling in bin i, as given by the histogram method, tends to , the true probability of falling in bin i, as n → ∞. Vi denotes the volume occupied by bin i.

(c) Consider the MNIST dataset with 784 dimensions. We divide each dimension into 2 bins. How many digits (base 10) does the total number of bins have?

(d) Assume the existence of an idealized MNIST classifier based on a histogram of m = 2 bins per dimension. The accuracy of the classifier increases by ε = 5% (starting from 10% and up to a maximum of 100%) each time k = 4 new data points are added to every bin. What is the smallest number of samples the classifier requires to reach an accuracy of 90%?

(e) Assuming a uniform distribution over all bins, what is the probability that a particular bin is empty, as a function of d, m and n?

Note the contrast between (b) and (e): even if for infinitely many datapoints, the histogram will be arbitrarily accurate at estimating the true distribution (b), in practice the number of samples required to even get a single datapoint in every region grows exponentially with d. 

Hint
StatisticsThe distribution of data across a continuous interval, or time period, is shown using a density plot. This graph is a variant of a histogram that plots data using kernel smoothing, which allows for smoother distributions by smoothing out noise. ...

Know the process

Students succeed in their courses by connecting and communicating with
an expert until they receive help on their questions

1
img

Submit Question

Post project within your desired price and deadline.

2
img

Tutor Is Assigned

A quality expert with the ability to solve your project will be assigned.

3
img

Receive Help

Check order history for updates. An email as a notification will be sent.

img
Unable to find what you’re looking for?

Consult our trusted tutors.

Developed by Versioning Solutions.