# magic 
#!pip install astropy

import astropy

astropy.__githash__

'27fb98c9d0a549556cb9f13f3df4a9d309e5a7ca'

Work in Progress -- starting to add commentary and tidy up

I connected a BMP180 temperature and pressure centre to a raspberry pi and have it running in my study.

I have been using this note book to look at the data as it is generated.

The code uses the Adafruit python library to extract data from the sensor.

I find plotting the data is a good way to take an initial look at it.

So, time for some pandas and matplotlib.

# Tell matplotlib to plot in line
%matplotlib inline

# import pandas
import pandas

# seaborn magically adds a layer of goodness on top of Matplotlib
# mostly this is just changing matplotlib defaults, but it does also
# provide some higher level plotting methods.
import seaborn

# Tell seaborn to set things up
seaborn.set()

# just check where I am
!pwd

/home/jng/devel/peakrisk/posts

infile = '../files/light.csv'

!scp 192.168.0.133:Adafruit_Python_BMP/light.csv .
!mv light.csv ../files

light.csv                                     100% 1043KB   1.0MB/s   00:00

data = pandas.read_csv(infile, index_col='date', parse_dates=['date'])

data.describe()

# Lets look at the temperature data
data.temp.plot()

<matplotlib.axes._subplots.AxesSubplot at 0x7f96b5c01fd0>

Looks like we have some bad data here. For the first few days things look ok though To start, lets look at the good bit of the data.

data[:4500].plot(subplots=True)

array([<matplotlib.axes._subplots.AxesSubplot object at 0x7f96b6c96d30>,
       <matplotlib.axes._subplots.AxesSubplot object at 0x7f96b595b668>,
       <matplotlib.axes._subplots.AxesSubplot object at 0x7f96b5756d30>,
       <matplotlib.axes._subplots.AxesSubplot object at 0x7f96b58c4ac8>], dtype=object)

That looks good. So for the first 4500 samples the data looks clean.

The pressure and sealevel_pressure plots have the same shape.

The sealevel_pressure is just the pressure recording adjusted for altitude.

Actually, since I am not telling the software what my altitude it is

It is a bit of a mystery what is causing the bad data after this.

One possibility is I have a separate process that is talking to the sensor that I am running in a console just so I can see the current figures.

I am running this with a linux watch command. I used the default parameters and it is running every 2 seconds.

I am wondering if the sensor code, or the hardware itself has some bugs if the code polls the sensor whilst it is already being probed.

I am now (11am BDA time July 3rd) running the monitor script with watch -n 600 so it only polls every 10 minutes. Will see if that improves things.

So, lets see if we can filter out the bad data

data.temp.plot()

<matplotlib.axes._subplots.AxesSubplot at 0x7f96b5c5c400>

# All the good temperature readings appear to be in the 25C - 32C range,
# so lets filter out the rest.
data.temp[(data.temp < 50.0) & (data.temp > 15.0)].plot()

<matplotlib.axes._subplots.AxesSubplot at 0x7f96b56e5358>

That looks good. You can see 8 days of temperatures rising through the day and then falling at night. Only a couple of degree difference here in Bermuda at present.

The Third day with the dip in temperature I believe there was a thunderstorm or two which cooled things off temporarily.

I really need to get a humidity sensor working to go with this.

Now lets see if we can spot the outliers and filter them out.

def spot_outliers(series):
    """ Compares the change in value in consecutive samples to the standard deviation

    If the change is bigger than that, assume it is an outlier.

    Note, that there will be two bad deltas, since the sample after the
    bad one will be bad too.
    """
    delta = series - series.shift()

    return delta.abs() > data.std()

outliers = spot_outliers(data)

# Plot temperature
data[~outliers].temp.plot()

<matplotlib.axes._subplots.AxesSubplot at 0x7f96b59e34a8>

data[~outliers].altitude.plot()

<matplotlib.axes._subplots.AxesSubplot at 0x7f96b57d95f8>

data[~outliers].plot(subplots=True)

array([<matplotlib.axes._subplots.AxesSubplot object at 0x7f96b577def0>,
       <matplotlib.axes._subplots.AxesSubplot object at 0x7f96b52ed438>,
       <matplotlib.axes._subplots.AxesSubplot object at 0x7f96b533bc18>,
       <matplotlib.axes._subplots.AxesSubplot object at 0x7f96b512f898>], dtype=object)

data[~outliers].sealevel_pressure.plot()

<matplotlib.axes._subplots.AxesSubplot at 0x7f96b5298358>

def smooth(data, thresh=None):
    
    means = data.mean()

    if thresh is None:
        sds = data.std()
    else:
        sds = thresh
    
    delta = data - data.shift()
    
    good = delta[abs(delta) < sds]

    print(good.describe())
    
    return delta.where(good, 0.0)

smooth(data).temp.cumsum().plot()

               temp      pressure      altitude  sealevel_pressure
count  1.554800e+04  15324.000000  15364.000000       15368.000000
mean  -4.569994e-19      0.024602     -0.002342           0.027069
std    3.300908e-02     67.787309      2.369789           7.740331
min   -1.000000e-01  -3906.000000   -112.489928        -392.000000
25%    0.000000e+00     -4.000000     -0.331805          -4.000000
50%    0.000000e+00      0.000000      0.000000           0.000000
75%    0.000000e+00      4.000000      0.331750           4.000000
max    1.000000e-01   3906.000000    112.489928         400.000000

<matplotlib.axes._subplots.AxesSubplot at 0x7f96b4f4f2b0>

smooth(data).describe()

               temp      pressure      altitude  sealevel_pressure
count  1.554800e+04  15324.000000  15364.000000       15368.000000
mean  -4.569994e-19      0.024602     -0.002342           0.027069
std    3.300908e-02     67.787309      2.369789           7.740331
min   -1.000000e-01  -3906.000000   -112.489928        -392.000000
25%    0.000000e+00     -4.000000     -0.331805          -4.000000
50%    0.000000e+00      0.000000      0.000000           0.000000
75%    0.000000e+00      4.000000      0.331750           4.000000
max    1.000000e-01   3906.000000    112.489928         400.000000

start = data[['temp', 'altitude']].irow(0)

(smooth(data, 5.0).cumsum()[['temp', 'altitude']] + start).plot(subplots=True)

               temp     pressure      altitude  sealevel_pressure
count  1.554800e+04  8202.000000  15324.000000        8184.000000
mean  -4.569994e-19     0.036942     -0.002413          -0.003177
std    3.300908e-02     2.458181      0.519410           2.458501
min   -1.000000e-01    -4.000000     -2.154588          -4.000000
25%    0.000000e+00    -2.000000     -0.331792          -2.000000
50%    0.000000e+00     0.000000      0.000000           0.000000
75%    0.000000e+00     2.000000      0.331728           2.000000
max    1.000000e-01     4.000000      2.240047           4.000000

array([<matplotlib.axes._subplots.AxesSubplot object at 0x7f96b4f586d8>,
       <matplotlib.axes._subplots.AxesSubplot object at 0x7f96b4c64588>], dtype=object)

Bingo! we have clean plots. Of course the irony is that I also seem to have found the problem with the bad data I was getting: don't have two processes querying these sensors at the same time, at least not with the current software. So the recent data no longer needs this smoothing.

Things to observe¶

So the daily rise and fall of temperature is pretty clear. There is only 2C spread most days.

The pressure plot is more interesting. Over the last week or so it has been generally high, but there is an interesting wave feature.

The other day I was at the Bermduda Weather service and mentioned this to Ian Currie, who immediately pointed out that air pressure is tidal.

So, my next plan is to dig out scikit-learn and some lunar data, maybe using astropy and see if we can fit a model to the pressure data for the tidal component.

At PyCon in Montreal https://us.pycon.org/2015/ Chris Waigl gave a talk about satellite mapping and some of the python tools that help with this

Following the talk I decided to take a look to see what satellite data is available around the time of the hurricanes Fay and Gonzalo, back in October 2014.

The hope was to be able to find suitable before and after images at a high enough resolution to use image processing software to help with damage analyis.

Chris's talk is on youtube (along with all the other PyCon talks) and embedded below.

from IPython import display

# Chris Waigl, Satellite mapping for everyone.
display.YouTubeVideo('MCHpt1FvblI')

TRMM¶

A little googling turned up this gem from the NASA's Tropical Rainfall Measuring Mission.

Image Credit: NASA/SSAI, Hal Pierce

Full article

This is a seven day animation, covering the period of Fay and Gonzalo.

Assuming rainfall is a good proxy for storm intensity, you can see how Fay intensified as it reached the island and how Gonzalo followed a very similar path, just six days later.

The key question with respect to Bermuda is whether this sort of data is available at higher resolution.

The article does mention that

Global Precipitation Measurement (GPM) mission product in late 2014 will supersede the TRMM project.

The Nasa GPM page has some wonderful animations of the sort of thing that is possible with GPM.

# 3-D animation of a typhoon from the GPM project
display.YouTubeVideo('kDlTZxejlbI')

Landsat data for Bermuda¶

A major challenge with satellite data is finding just what images are available.

Landsat has a well documented site created by the USGS

However it is still time consuming to see what is available.

Downloads can be large, roughly 1GB per satellite image. These images generally contain multiple layers for different parts of the spectrum.

To download the larger files you need to register and get an API key.

Once registered I downloaded a couple of images, either side of the October storms.

Below are my attempts to extract and plot the data.

# lets start with matplotlib
%matplotlib inline
from matplotlib import pyplot

# Chris recommended the rasterio library

import rasterio

infile = '../data/LC80060382014275LGN00_B2.TIF'

# This is pretty simple, just open the TIFF file and you have
# an object that can tell you all sorts of things about the image

data = rasterio.open(infile)
data.width, data.height

(7771, 7911)

# take a look at the meta data
data.meta

{'affine': Affine(30.0, 0.0, 192885.0,
       0.0, -30.0, 3631815.0),
 'blockxsize': 7771,
 'blockysize': 1,
 'count': 1,
 'crs': {'init': 'epsg:32620'},
 'driver': 'GTiff',
 'dtype': 'uint16',
 'height': 7911,
 'nodata': None,
 'tiled': False,
 'transform': (192885.0, 30.0, 0.0, 3631815.0, 0.0, -30.0),
 'width': 7771}

# read the bands in the file, there will be as many bands as
# the count above
bands = data.read()

# take a look at the data -- numpy arrays with 16 bit values

bands

array([[[0, 0, 0, ..., 0, 0, 0],
        [0, 0, 0, ..., 0, 0, 0],
        [0, 0, 0, ..., 0, 0, 0],
        ..., 
        [0, 0, 0, ..., 0, 0, 0],
        [0, 0, 0, ..., 0, 0, 0],
        [0, 0, 0, ..., 0, 0, 0]]], dtype=uint16)

# so we have a 3D array, first dimension is the band
bands[0].shape

(7911, 7771)

img = bands[0]

# just take every 10th pixel for now -- imshow does not handle
# large images well.
img = img[::10, ::10]

img.shape

(792, 778)

# now plot the thing.
pyplot.imshow(img)

<matplotlib.image.AxesImage at 0x7ffbfc0fc2b0>

Summary¶

So we have succeeded in downloading and plotting one of these bands.

Now time to play spot Bermuda. First impressions are this particular data is likely not high enough resolution to be useful.

A second thing to note is that the NASA sites are, understandably, quite US-centric. To do comprehensive studies of satellite data for Bermuda it looks like it will be worthwhile to create local mirrors of the key data.

In particular, whilst some of these images are quite large, the part covering Bermuda will generally be much more manageable.

Zooming in on Bermuda¶

# put it all together

def plot_image(infile, box=None, axes=None):
    
    if axes is None:
        fig, axes = pyplot.subplots(1, 2, figsize=(8,8))
        
    
    if box is None:
        box = 1000, 2200, 3700, 5500
    
    a, b, c, d = box
    data = rasterio.open(infile)
    bands = data.read()
    img = bands[0]
    img = img[a:b, c:d]
    
    axes.imshow(img)

# plotting images either side of the hurricane

fig, axes = pyplot.subplots(1, 2, figsize=(8,8))

#pyplot.subplot(1,2,1)
x = 3
top = 1300
left = 4000
width = 1200
height = 1000
box = (top, top + height, left, left + width)
infile = '../data/LC80060382014275LGN00_B%d.TIF' % x
plot_image(infile, box=box, axes=axes[0])

#pyplot.subplot(1,2,2)
infile = '../data/LC80060382014307LGN00_B%d.TIF' % x
plot_image(infile, box=box, axes=axes[1])

	temp	pressure	altitude	sealevel_pressure
count	15641.000000	15641.000000	15641.000000	15641.000000
mean	28.976920	101520.265009	-27.184538	101818.240841
std	4.046954	4264.685185	312.443798	3095.729562
min	-27.100000	28421.000000	-2749.583598	37537.000000
25%	28.200000	101719.000000	-49.253431	101721.000000
50%	28.800000	101823.000000	-41.378551	101824.000000
75%	29.400000	101918.000000	-32.833799	101919.000000
max	162.900000	128808.000000	7610.794710	139326.000000

	temp	pressure	altitude	sealevel_pressure
count	1.564100e+04	15641.000000	15641.000000	15641.000000
mean	-4.542822e-19	0.024103	-0.002301	0.026597
std	3.291079e-02	67.096817	2.348710	7.672479
min	-1.000000e-01	-3906.000000	-112.489928	-392.000000
25%	0.000000e+00	-4.000000	-0.331683	-4.000000
50%	0.000000e+00	0.000000	0.000000	0.000000
75%	0.000000e+00	4.000000	0.331652	4.000000
max	1.000000e-01	3906.000000	112.489928	400.000000

Three types of information

Everyday Living on a small island

Cuban Ubuntu

Who are you going to trust?

Homeomorphism

Shannon

Things to observe¶

Your security depends on free software

Other common clauses

GPL v non-GPL

Affect of the GPL

Benefits of free software

Security benefits

Free software

Sharing your work openly

Potential sources of vulnerabilities in free software

Simple bugs

Malicious contributions

Compromise of repository

Non-free software

Reverse engineering

Docutils System Messages

Day 1

Day 2

Arch

Raspbian

Raspbain installer

Update

Satellite data

Satellite data for smaller jurisdictions

Weather station data

Damage surveys

Humanitarian OpenStreetMap Team

TRMM¶

Landsat data for Bermuda¶

Summary¶

Zooming in on Bermuda¶