st-scripts: comparison parsing

equal deleted inserted replaced

-:7f8b6a9fb61d
+:fc545d07b0ff
 parsing data is all about reading the data and converting it into a form which
 can be used for computations. In our case, that is numbers.
 We can clearly see that the problem involves reading files and tokenizing.
-Let us learn about tokenizing strings. Let us define a string first. Type::
+Let us learn about tokenizing strings. Let us define a string first. Type
+::
 line = "parse this           string"
-We are now going to split this string on whitespace.::
+We are now going to split this string on whitespace.
+::
 line.split()
 As you can see, we get a list of strings. Which means, when split is called
 without any arguments, it splits on whitespace. In simple words, all the spaces
 are treated as one big space.
 split also can split on a string of our choice. This is acheived by passing
-that as an argument. But first lets define a sample record from the file.::
+that as an argument. But first lets define a sample record from the file.
+::
 record = "A;015163;JOSEPH RAJ S;083;042;47;AA;72;244;;;"
 record.split(';')
 We can see that the string is split on ';' and we get each field seperately.
 surrounded by whitespace are treated as two different regions. We must find a
 way to remove all the whitespace around a string so that "B" and a "B" with
 white spaces are dealt as same.
 This is possible by using the =strip= method of strings. Let us define a
-string by typing::
+string by typing
+::
 unstripped = "     B    "
 unstripped.strip()
 We can see that strip removes all the whitespace around the sentence
 %% 2 %% What happens to the white space inside the sentence when it is stripped
 {{{ continue from paused state }}}
-Type::
+Type
+::
 a_str = "         white      space            "
 a_str.strip()
 We see that the whitespace inside the sentence is only removed and anything
 The splitting and stripping operations are done on a string and their result is
 also a string. hence the marks that we have are still strings and mathematical
 operations are not possible. We must convert them into integers or floats
 We shall look at converting strings into floats. We define an float string
-first. Type::
+first. Type
+::
 mark_str = "1.25"
 mark = int(mark_str)
 mark_str
 mark
 %% 3 %% What happens if you do int("1.25")
 {{{ continue from paused state }}}
 It raises an error since converting a float string into integer directly is
-not possible. It involves an intermediate step of converting to float.::
+not possible. It involves an intermediate step of converting to float.
+::
 dcml_str = "1.25"
 flt = float(dcml_str)
 flt
 number = int(flt)
 Using =int= it is also possible to convert float into integers.
 Now that we have all the machinery required to parse the file, let us solve the
 problem. We first read the file line by line and parse each record. We see if
-the region code is B and store the marks accordingly.::
+the region code is B and store the marks accordingly.
+::
 math_marks_B = [] # an empty list to store the marks
 for line in open("/home/fossee/sslc1.txt"):
 fields = line.split(";")
 if region_code == "AA":
 math_marks_B.append(math_mark)
 Now we have all the maths marks of region "B" in the list math_marks_B.
-To get the mean, we just have to sum the marks and divide by the length.::
+To get the mean, we just have to sum the marks and divide by the length.
+::
 math_marks_mean = sum(math_marks_B) / len(math_marks_B)
 math_marks_mean
 {{{ Show summary slide }}}

changeset 137	fc545d07b0ff
parent 134	543c1cc488ca
child 140	bc023595e167