# Distribution of Numbers in Pascal's Triangle

# Distribution of Numbers in Pascal's Triangle

### This post uses Python code to explore a sentence from the book "Single Digits": Any number in Pascal’s triangle that is not in the outer two layers will appear at least three times, usually four.

Join the DZone community and get the full member experience.

Join For FreeHow to Simplify Apache Kafka. Get eBook.

This post explores a sentence from the book Single Digits:

Any number in Pascal’s triangle that is *not* in the outer two layers will appear at least three times, usually four.

Pascal’s triangle contains the binomial coefficients C(*n*, *r*) where *n* ranges over non-negative numbers and *r* ranges from 0 to *n*. The outer layers are the elements with *r* equal to 0, 1, *n*-1, and *n*.

We’ll write some Python code to explore how often the numbers up to 1,000,000 appear. How many rows of Pascal’s triangle should we compute? The smallest number in row *n* is C(*n*, 2). Now 1,000,000 is between C(1414, 2) and C(1415, 2) so we need row 1414. This means we need *N* = 1415 below because the row numbers start with 0.

I’d like to use a NumPy array for storing Pascal’s triangle. In the process of writing this code, I realized that a NumPy array with `dtype int`

doesn’t contain Python’s arbitrary-sized integers. This makes sense because NumPy is designed for efficient computation, and using a NumPy array to contain huge integers is unnatural. But I’d like to do it anyway, and the to make it happen is to set `dtype`

to `object`

.

```
import numpy as np
from collections import Counter
N = 1415 # Number of rows of Pascal's triangle to compute
Pascal = np.zeros((N, N), dtype=object)
Pascal[0, 0] = 1
Pascal[1,0] = Pascal[1,1] = 1
for n in range(2, N):
for r in range(0, n+1):
Pascal[n, r] = Pascal[n-1, r-1] + Pascal[n-1, r]
c = Counter()
for n in range(4, N):
for r in range(2, n-1):
p = Pascal[n, r]
if p <= 1000000:
c[p] += 1
```

When we run this code, we find that our counter contains 1732 elements. That is, of the numbers up to one million, only 1732 of them appear inside Pascal’s triangle when we disallow the outer two layers. (The second layer contains consecutive integers, so every positive integer appears in Pascal’s triangle. But most integers *only* appear in the second layer.)

When *Single Digits* speaks of “Any number in Pascal’s triangle that is *not* in the outer two layers” this cannot refer to numbers that are not in the outer two layers because *every* natural number appears in the outer two layers. Also, when it says the number “will appear at least three times, usually four” it is referring to the entire triangle, i.e. including the outer two layers. So another way to state the sentence quoted above is as follows.

Define the interior of Pascal’s triangle to be the triangle excluding the outer two layers. Every number *n* in the interior of Pascal’s triangle appears twice more outside of the interior, namely as C(*n*, 1) and C(*n*, *n*-1). Most often *n* appears at least twice in the interior as well.

This means that any number you find in the interior of Pascal’s triangle, interior meaning not in the two outer layers, will appear at least three times in the full triangle, usually more.

Here are the statistics from our code above regarding just the *interior* of Pascal’s triangle:

- One number, 3003, appears six times.
- Six numbers appear four times: 120, 210, 1540, 7140, 11628, and 24310.
- Ten numbers appear only once: 6, 20, 70, 252, 924, 3432, 12870, 48620, 184756, and 705432.
- The large majority of numbers, 1715 out of 1732, appear twice.

12 Best Practices for Modern Data Ingestion. Download White Paper.

Published at DZone with permission of John Cook , DZone MVB. See the original article here.

Opinions expressed by DZone contributors are their own.

## {{ parent.title || parent.header.title}}

## {{ parent.tldr }}

## {{ parent.linkDescription }}

{{ parent.urlSource.name }}