loading-data-from-files.rst
author amit
Thu, 23 Sep 2010 13:14:31 +0530
changeset 204 65e5e2362bc9
parent 199 680a0692529f
permissions -rw-r--r--
small change in getting_started_with_lists
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
144
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
     1
========
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
     2
 Script
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
     3
========
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
     4
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
     5
Welcome to this tutorial on loading data from files. 
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
     6
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
     7
{{{ Screen shows welcome slide }}}
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
     8
199
680a0692529f Incorporated changes from review into loading-data-from-files.rst.
Puneeth Chaganti <punchagan@fossee.in>
parents: 192
diff changeset
     9
We often require to plot points obtained from experimental
680a0692529f Incorporated changes from review into loading-data-from-files.rst.
Puneeth Chaganti <punchagan@fossee.in>
parents: 192
diff changeset
    10
observations. In this tutorial we shall learn to read data from files
680a0692529f Incorporated changes from review into loading-data-from-files.rst.
Puneeth Chaganti <punchagan@fossee.in>
parents: 192
diff changeset
    11
and save it into sequences that can later be used to plot.
144
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    12
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    13
{{{ Show the outline for this tutorial }}} 
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    14
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    15
We shall use the ``loadtxt`` command to load data from files. We will
199
680a0692529f Incorporated changes from review into loading-data-from-files.rst.
Puneeth Chaganti <punchagan@fossee.in>
parents: 192
diff changeset
    16
be looking at how to read a file with multiple columns of data and
680a0692529f Incorporated changes from review into loading-data-from-files.rst.
Puneeth Chaganti <punchagan@fossee.in>
parents: 192
diff changeset
    17
load each column of data into a sequence. 
192
3a839dca2bdb reviewed loading_data_from_files
Nishanth <nishanth@fossee.in>
parents: 144
diff changeset
    18
144
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    19
{{{ switch back to the terminal }}}
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    20
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    21
As usual, let us start IPython, using 
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    22
::
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    23
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    24
  ipython -pylab 
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    25
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    26
Now, Let us begin with reading the file primes.txt, which contains
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    27
just a list of primes listed in a column, using the loadtxt command.
199
680a0692529f Incorporated changes from review into loading-data-from-files.rst.
Puneeth Chaganti <punchagan@fossee.in>
parents: 192
diff changeset
    28
The file, in our case, is present in ``/home/fossee/primes.txt``. 
680a0692529f Incorporated changes from review into loading-data-from-files.rst.
Puneeth Chaganti <punchagan@fossee.in>
parents: 192
diff changeset
    29
680a0692529f Incorporated changes from review into loading-data-from-files.rst.
Puneeth Chaganti <punchagan@fossee.in>
parents: 192
diff changeset
    30
{{{ Navigate to the path in the OS, open the file and show it }}}
144
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    31
192
3a839dca2bdb reviewed loading_data_from_files
Nishanth <nishanth@fossee.in>
parents: 144
diff changeset
    32
.. #[punch: do we need a slide for showing the path?]
144
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    33
199
680a0692529f Incorporated changes from review into loading-data-from-files.rst.
Puneeth Chaganti <punchagan@fossee.in>
parents: 192
diff changeset
    34
.. We use the ``cat`` command to see the contents of this file. 
144
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    35
199
680a0692529f Incorporated changes from review into loading-data-from-files.rst.
Puneeth Chaganti <punchagan@fossee.in>
parents: 192
diff changeset
    36
.. #[punch: should we show the cat command here? seems like a good place
680a0692529f Incorporated changes from review into loading-data-from-files.rst.
Puneeth Chaganti <punchagan@fossee.in>
parents: 192
diff changeset
    37
   to do it] ::
144
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    38
199
680a0692529f Incorporated changes from review into loading-data-from-files.rst.
Puneeth Chaganti <punchagan@fossee.in>
parents: 192
diff changeset
    39
     cat /home/fossee/primes.txt
144
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    40
192
3a839dca2bdb reviewed loading_data_from_files
Nishanth <nishanth@fossee.in>
parents: 144
diff changeset
    41
.. #[Nishanth]: A problem for windows users.
3a839dca2bdb reviewed loading_data_from_files
Nishanth <nishanth@fossee.in>
parents: 144
diff changeset
    42
                Should we simply open the file and show them the data
3a839dca2bdb reviewed loading_data_from_files
Nishanth <nishanth@fossee.in>
parents: 144
diff changeset
    43
                so that we can be fine with GNU/Linux ;) and windows?
3a839dca2bdb reviewed loading_data_from_files
Nishanth <nishanth@fossee.in>
parents: 144
diff changeset
    44
144
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    45
Now let us read this list into the variable ``primes``.
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    46
::
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    47
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    48
  primes = loadtxt('/home/fossee/primes.txt')
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    49
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    50
``primes`` is now a sequence of primes, that was listed in the file,
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    51
``primes.txt``.
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    52
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    53
We now type, ``print primes`` to see the sequence printed.
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    54
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    55
We observe that all of the numbers end with a period. This is so,
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    56
because these numbers are actually read as ``floats``. We shall learn
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    57
about them, later.
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    58
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    59
Now, let us use the ``loadtxt`` command to read a file that contains
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    60
two columns of data, ``pendulum.txt``. This file contains the length
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    61
of the pendulum in the first column and the corresponding time period
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    62
in the second.
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    63
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    64
%%1%% Pause the video here, and use the ``cat`` command to view the
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    65
contents of this file and then resume the video.
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    66
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    67
This is how we look at the contents of the file, ``pendulum.txt``
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    68
::
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    69
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    70
  cat /home/fossee/pendulum.txt
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    71
192
3a839dca2bdb reviewed loading_data_from_files
Nishanth <nishanth@fossee.in>
parents: 144
diff changeset
    72
.. #[Nishanth]: The first column is L values and second is T values
3a839dca2bdb reviewed loading_data_from_files
Nishanth <nishanth@fossee.in>
parents: 144
diff changeset
    73
                from a simle pelculum experiment.
3a839dca2bdb reviewed loading_data_from_files
Nishanth <nishanth@fossee.in>
parents: 144
diff changeset
    74
                Since you are using the variable names later in the
3a839dca2bdb reviewed loading_data_from_files
Nishanth <nishanth@fossee.in>
parents: 144
diff changeset
    75
                script.
3a839dca2bdb reviewed loading_data_from_files
Nishanth <nishanth@fossee.in>
parents: 144
diff changeset
    76
                Not necessary but can be included also.
3a839dca2bdb reviewed loading_data_from_files
Nishanth <nishanth@fossee.in>
parents: 144
diff changeset
    77
144
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    78
Let us, now, read the data into the variable ``pend``. Again, it is
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    79
assumed that the file is in ``/home/fossee/``
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    80
::
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    81
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    82
  pend = loadtxt('/home/fossee/pendulum.txt')
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    83
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    84
Let us now print the variable ``pend`` and see what's in it. 
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    85
::
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    86
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    87
  print pend
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    88
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    89
Notice that ``pend`` is not a simple sequence like ``primes``. It has
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    90
two sequences, containing both the columns of the data file. Let us
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    91
use an additional argument of the ``loadtxt`` command, to read it into
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    92
two separate, simple sequences.
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    93
::
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    94
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    95
  L, T = loadtxt('/home/fossee/pendulum.txt', unpack=True)
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
    96
192
3a839dca2bdb reviewed loading_data_from_files
Nishanth <nishanth@fossee.in>
parents: 144
diff changeset
    97
.. #[Nishanth]: It has a sequence of items in which each item contains
3a839dca2bdb reviewed loading_data_from_files
Nishanth <nishanth@fossee.in>
parents: 144
diff changeset
    98
                two values. first is l and second is t
3a839dca2bdb reviewed loading_data_from_files
Nishanth <nishanth@fossee.in>
parents: 144
diff changeset
    99
144
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   100
Let us now, print the variables L and T, to see what they contain.
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   101
::
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   102
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   103
  print L
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   104
  print T
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   105
192
3a839dca2bdb reviewed loading_data_from_files
Nishanth <nishanth@fossee.in>
parents: 144
diff changeset
   106
.. #[Nishanth]: Stress on ``unpack=True`` ??
3a839dca2bdb reviewed loading_data_from_files
Nishanth <nishanth@fossee.in>
parents: 144
diff changeset
   107
144
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   108
Notice, that L and T now contain the first and second columns of data
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   109
from the data file, ``pendulum.txt``, and they are both simple
199
680a0692529f Incorporated changes from review into loading-data-from-files.rst.
Puneeth Chaganti <punchagan@fossee.in>
parents: 192
diff changeset
   110
sequences. ``unpack=True`` has given us the two columns in to two
680a0692529f Incorporated changes from review into loading-data-from-files.rst.
Puneeth Chaganti <punchagan@fossee.in>
parents: 192
diff changeset
   111
separate sequences instead of one complex sequence. 
144
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   112
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   113
{{{ show the slide with loadtxt --- other features }}}
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   114
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   115
In this tutorial, we have learnt the basic use of the ``loadtxt``
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   116
command, which is capable of doing a lot more than we have used it for
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   117
until now, for example
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   118
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   119
%%2%% Pause the video here, and read the file
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   120
``pendulum_semicolon.txt`` which contains the same data as
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   121
``pendulum.txt``, but the columns are separated by semi-colons instead
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   122
of spaces. Use the IPython help to see how to do this. Once you have
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   123
finished, resume the video to look at the solution.
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   124
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   125
{{{ switch back to the terminal }}}
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   126
::
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   127
199
680a0692529f Incorporated changes from review into loading-data-from-files.rst.
Puneeth Chaganti <punchagan@fossee.in>
parents: 192
diff changeset
   128
  L, T = loadtxt('/home/fossee/pendulum_semicolon.txt', unpack=True, delimiter=';')
144
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   129
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   130
  print L
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   131
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   132
  print T
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   133
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   134
This brings us to the end of this tutorial. 
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   135
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   136
{{{ show the summary slide }}}
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   137
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   138
You should now be able to do the following, comfortably. 
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   139
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   140
  + Read data from files, containing a single column of data using the
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   141
    ``loadtxt`` command.
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   142
  + Read multiple columns of data, separated by spaces or other
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   143
    delimiters.
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   144
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   145
Thank you!   
476ea1730aee Added rst files for scripts.
Puneeth Chaganti <punchagan@gmail.com>
parents:
diff changeset
   146