These tips and tricks are a little bit more advanced than the ones in the previous notebook. I try to update the post every once in a while with the latest version of Python, so it should be roughly up to date.

Table of Contents

Learning your environment
Aliasing
Functional programming
- Zip
- Filter
- Map
- Reduce
Error handling
Testing
Enums
- IntEnums
  - Enums vs IntEnums
Scope
Type Annotations
Python Disassembler
Inspection
Namespace Mangling

Learning your environment

Where am I?

import os
os.getcwd() #get current working directory

'C:\\Users\\Julius\\Google Drive\\JupyterNotebooks\\Blog'

What verison of Python am I using?

import sys
sys.version

'3.8.5 (default, Sep  3 2020, 21:29:08) [MSC v.1916 64 bit (AMD64)]'

Where is my Python interpreter located?

sys.executable

'C:\\Users\\Julius\\anaconda3\\envs\\tf\\python.exe'

What conda environment am I in?

!conda env list

# conda environments:
#
base                  *  C:\Users\Julius\anaconda3
pt                       C:\Users\Julius\anaconda3\envs\pt
tf                       C:\Users\Julius\anaconda3\envs\tf
tf-gpu                   C:\Users\Julius\anaconda3\envs\tf-gpu

Where will Python look for modules?

import sys

print(sys.path)

['C:\\Users\\Julius\\Google Drive\\JupyterNotebooks\\Blog', 'C:\\Users\\Julius\\Documents\\GitHub', 'C:\\Users\\Julius\\Documents\\GitHub\\cv\\src\\py', 'C:\\Users\\Julius\\Documents\\GitHub\\fastai-pythonic', 'C:\\Users\\Julius\\Documents\\GitHub\\facv\\src', 'C:\\Users\\Julius\\Documents\\GitHub\\fastai2', 'C:\\Users\\Julius\\Documents\\GitHub\\ObjectDetection', 'C:\\Users\\Julius\\Documents\\GitHub\\fastcore', 'C:\\Users\\Julius\\Documents\\GitHub\\cv_dataclass\\src', 'C:\\Users\\Julius\\Google Drive\\JupyterNotebooks\\Blog', 'C:\\Users\\Julius\\anaconda3\\envs\\tf\\python38.zip', 'C:\\Users\\Julius\\anaconda3\\envs\\tf\\DLLs', 'C:\\Users\\Julius\\anaconda3\\envs\\tf\\lib', 'C:\\Users\\Julius\\anaconda3\\envs\\tf', '', 'C:\\Users\\Julius\\AppData\\Roaming\\Python\\Python38\\site-packages', 'C:\\Users\\Julius\\anaconda3\\envs\\tf\\lib\\site-packages', 'C:\\Users\\Julius\\anaconda3\\envs\\tf\\lib\\site-packages\\win32', 'C:\\Users\\Julius\\anaconda3\\envs\\tf\\lib\\site-packages\\win32\\lib', 'C:\\Users\\Julius\\anaconda3\\envs\\tf\\lib\\site-packages\\Pythonwin', 'C:\\Users\\Julius\\anaconda3\\envs\\tf\\lib\\site-packages\\IPython\\extensions', 'C:\\Users\\Julius\\.ipython']

Python modules

Where are the site packages held?

import site
site.getsitepackages()

['C:\\Users\\Julius\\anaconda3\\envs\\tf',
 'C:\\Users\\Julius\\anaconda3\\envs\\tf\\lib\\site-packages']

Where is a particular package?

import tensorflow
print(tensorflow.__file__)

C:\Users\Julius\anaconda3\envs\tf\lib\site-packages\tensorflow\__init__.py

import matplotlib.pyplot as plt
print(plt.__file__)

C:\Users\Julius\anaconda3\envs\tf\lib\site-packages\matplotlib\pyplot.py

Aliasing

Python uses references. This can create aliasing problems.

# Here's an example that isn't a problem
old_list = [1,2,3,4]
new_list = old_list
new_list[2]=7
print(old_list)

[1, 2, 7, 4]

old_list has changed without being changed directly. This is because when I did new_list = old_list, it created a reference from the value of old_list to a new variable, new_list. But it did not make a second copy of the value, so they are pointing to the same value. If that value is changed both variables will see the change.

This can be a desired result, but sometimes it isn’t. In those cases you can make a copy of the value instead of just getting a reference to the old value. Do this by setting new_list equal to old_list[:] or list(old_list)

old_list = [1,2,3,4]
new_list = old_list[:]
new_list[2]=7
print(old_list)

[1, 2, 3, 4]

old_list = [1,2,3,4]
new_list = list(old_list)
new_list[2]=7
print(old_list)

[1, 2, 3, 4]

To see this in more detail, you can look at the id of the variable.

old_list = [1,2,3,4]
new_list = old_list
print(id(old_list))
print(id(new_list))
new_list = old_list[:]
# By using a copy, new_list gets a different id")
print(id(new_list))

3179513172992
3179513172992
3179513163712

Functional programming

Zip

a = range(5)
b = range(5,10)
c = zip(a,b)
c = list(c)
print(c)

[(0, 5), (1, 6), (2, 7), (3, 8), (4, 9)]

# Or, you can unzip
ca, cb = zip(*c)

print(ca)
print(cb)

(0, 1, 2, 3, 4)
(5, 6, 7, 8, 9)

You can use this to quickly sort two lists while keeping them in sync.

preds = [0.1, 0.95, 0.11, 0.35, 0.75, 0.8]
y_true = [0, 1, 0, 0, 1, 0]

sorted_preds, sorted_y_true = zip(*sorted(zip(preds, y_true), reverse=True))

print(sorted_preds)
print(sorted_y_true)

(0.95, 0.8, 0.75, 0.35, 0.11, 0.1)
(1, 0, 1, 0, 0, 0)

Filter

Filter is good for, as it sounds, filtering.

list(filter(lambda x: x > 5, range(10)))

[6, 7, 8, 9]

Note that you could also do this with a list comprehension.

[x for x in range(10) if x > 5]

[6, 7, 8, 9]

Map

Map can actually changes the values.

list(map(lambda x : str(x), range(10)))

['0', '1', '2', '3', '4', '5', '6', '7', '8', '9']

And the list comprehensions way.

[str(x) for x in range(10)]

['0', '1', '2', '3', '4', '5', '6', '7', '8', '9']

range, map, zip, and filter are all iterables in Python 3.X, so to see the actual values you can use the list command.

list(map(abs, [-5, -2, 1]))

[5, 2, 1]

Or you can use lambda, the anonymous function.

list(map(lambda x: x**2, range(4)))

[0, 1, 4, 9]

You can also use multiple arguments with lambdas

list(map(lambda x, y: x + y, range(4), [10, 20, 30, 40]))

[10, 21, 32, 43]

Reduce

from functools import reduce

reduce(lambda x,y: x+y, [23, 34, 12, 23])

Error handling

There is a hierarchy of Python Exceptions. It is best practice to use the most specific one that applies to your case, and if none do, to raise a custom Exception. Here are some of the most common Exceptions.

IndexError

a = [1,2,3,4]
try:
    print(a[5])
except IndexError as e:
    print("There is no element #5, so you get an IndexError: ", e)

There is no element #5, so you get an IndexError:  list index out of range

NameError

my_variable = 4
try:
    print(my_veriable)
except NameError:
    print("I have introduced a typo, so the call to variable 'my_veriable' returns an error")

I have introduced a typo, so the call to variable 'my_veriable' returns an error

TypeError

# Trying to use a type in a way it cannot be
print(a[2]) # works fine
try:
    print(a['two']) # returns an error
except TypeError as err:
    print(err)

3
list indices must be integers or slices, not str

SyntaxError

Syntax errors can be a little different. That’s because the syntax is wrong, which prevents the try/except block from being set up.

try:
    print("Syntax often result from missing parentheses"
except SyntaxError:
    print("This is never printed")

  File "C:\Users\Julius\AppData\Local\Temp/ipykernel_29776/3723339239.py", line 3
    except SyntaxError:
    ^
SyntaxError: invalid syntax

There is a way around this using eval. Let’s say you’re trying to use a try/except in the following:

try:
    [2 * x for x in [1,2,3] if x > 1 else 0]
except SyntaxError:
    print("This is never printed")

  File "C:\Users\Julius\AppData\Local\Temp/ipykernel_29776/3576225009.py", line 2
    [2 * x for x in [1,2,3] if x > 1 else 0]
                                     ^
SyntaxError: invalid syntax

If you simply wrap it in an eval statement, it still doesn’t work.

try:
    eval([2 * x for x in [1,2,3] if x > 1 else 0])
except SyntaxError:
    print("This is never printed")

  File "C:\Users\Julius\AppData\Local\Temp/ipykernel_29776/241822697.py", line 2
    eval([2 * x for x in [1,2,3] if x > 1 else 0])
                                          ^
SyntaxError: invalid syntax

But if it’s a string, it will catch the SyntaxError.

try:
    eval("[2 * x for x in [1,2,3] if x > 1 else 0]")
except SyntaxError:
    print("But this is printed")

But this is printed

And if there wasn’t an error, it would still evaluate it.

try:
    a = eval("[2 * x for x in [1,2,3] if x > 1]")
except SyntaxError:
    print("But this is printed")
print(a)

[4, 6]

Creating Your Own Exceptions

It’s a good idea to create your own exceptions. They don’t actually need to do anything a lot of the time (other than inherit from Exception). Their name alone is valuable.

class NiException(Exception):
    pass

try:
    if 'ni' in 'knights who say ni':
        raise NiException
except NiException:
    print("We raised and caught our custom exception")

We raised and caught our custom exception

Try, except statements

#Can also be more specific:

print("If you provide two integers, I will devide the first by the second")
try:
    a = int(input('Give me a number: '))
    b = int(input('Give me another: '))
    print(a/b)
except ValueError:
    print("That's not an int")
except ZeroDivisionError:
    print("Can't divide by zero")
except:
    print("I don't even know what you did wrong")

If you provide two integers, I will devide the first by the second
Give me a number: 2
Give me another: 0
Can't divide by zero

You can also use a finally statement to do something even after an error has been raised

try:
    a = int(input('Give me a number: '))
    b = int(input('Give me another: '))
    print(a/b)
except ValueError:
    print("That's not an int")
finally:
    print("Whether there's an exception or not, this runs. Good for closing a file.")

Give me a number: 1
Give me another: 1.5
That's not an int
Whether there's an exception or not, this runs. Good for closing a file.

You can also raise exceptions directly

try:
    1/0
except ZeroDivisionError:
    print("I've just picked up a fault in the AE35 unit. It's going to go 100% failure in 72 hours.")
    raise NiException

I've just picked up a fault in the AE35 unit. It's going to go 100% failure in 72 hours.

---------------------------------------------------------------------------

ZeroDivisionError                         Traceback (most recent call last)

<ipython-input-45-413944f2b329> in <module>
      1 try:
----> 2     1/0
      3 except ZeroDivisionError:

ZeroDivisionError: division by zero

During handling of the above exception, another exception occurred:

NiException                               Traceback (most recent call last)

<ipython-input-45-413944f2b329> in <module>
      3 except ZeroDivisionError:
      4     print("I've just picked up a fault in the AE35 unit. It's going to go 100% failure in 72 hours.")
----> 5     raise NiException

NiException:

Note that you can’t use this on some SyntaxErrors as the compiler has to parse everything to set up the try/except blocks, and a SyntaxError can prevent that from happening. Here’s an example:

try:
    [2 * x for x in [1,2,3] if x > 1 else 0]
except SyntaxError:
    print("This doesn't get printed")

  File "<ipython-input-46-f08e10e9850e>", line 2
    [2 * x for x in [1,2,3] if x > 1 else 0]
                                     ^
SyntaxError: invalid syntax

If you want to do this, you can wrap the statement in an eval statement, so the compiler has time to set up the try/except block.

try:
    eval("[2 * x for x in [1,2,3] if x > 1 else 0]")
except SyntaxError:
    print("This DOES get printed")

This DOES get printed

Assert

def div_by_two(x):
    assert (x%2 == 0), "Number must be even"#Assert that x is even; this makes the program stop immediately if it is not
    return x / 2
try:
    print(div_by_two(3))
except AssertionError as err:
    print('Error:', err)

Error: Number must be even

Aliasing

Python uses references. This can create aliasing problems

# Here's an example that isn't a problem
old_list = [1,2,3,4]
new_list = old_list
new_list[2]=7
print(old_list)

[1, 2, 7, 4]

old_list = [1,2,3,4]
new_list = old_list[:]
new_list[2]=7
print(old_list)

[1, 2, 3, 4]

old_list = [1,2,3,4]
new_list = list(old_list)
new_list[2]=7
print(old_list)

[1, 2, 3, 4]

To see this in more detail, you can look at the id of the variable

old_list = [1,2,3,4]
new_list = old_list
print(id(old_list))
print(id(new_list))
new_list = old_list[:]
# By using a copy, new_list gets a different id")
print(id(new_list))

3179535780032
3179535780032
3179534011968

Testing

For testing, I highly recommend pytest. One issue I had with it when I was getting started was that if it mocked the inputs I couldn’t run the test as a file (like to debug in VSCode). It turns out this is all you need.

import pytest
if __name__ == "__main__":
    pytest.main([__file__])

Or, if you just one to test a function or two, you can do

if __name__ == "__main__":
    pytest.main([test_my_func()])

Enums

from enum import Enum

Enums are a simple way of aliasing values.

class Animals(Enum):
    cat = 1
    dog = 2
    fish = 3

Animals.cat

<Animals.cat: 1>

for animal in Animals:
    print(animal)
    print(animal.value)

Animals.cat
1
Animals.dog
2
Animals.fish
3

IntEnums

IntEnums are like Enums except that you can also do integer comparison with them.

from enum import IntEnum

class Birds(IntEnum):
    cardinal = 1
    blue_jay = 2

Enums vs IntEnums

# can int compare IntEnums
print(Animals.dog == 2)
print(Birds.blue_jay == 2)

False
True

print(Birds.blue_jay < Birds.cardinal + 3)
try:
    print(Animals.dog < Animals.cat + 3)
except TypeError:
    print("Can't do interger comparison with standard Enums")

True
Can't do interger comparison with standard Enums

Scope

Scope is very important in Python. Different objects perform differently when they are modified in a function.

a = [1,2,3]

def cl(s):
    s[1] = 5

cl(a)

[1, 5, 3]

def cv(a):
    a = 5

q = 2

cv(q)

Type Annotations

I’m really into type annotations in Python, but I have to say, sometimes they make things ugly. Compare the following, with and without type annotations.

import pandas as pd
def func(inp):
    df = pd.read_csv(inp)
    return df

import pandas as pd
from pathlib import Path
from typing import AnyStr, IO, Union
def f(inp: Union[str, Path, IO[AnyStr]]) -> pd.DataFrame:
    df = pd.read_csv(inp)
    return df

You can see how pandas tried to solve the problem here, but it’s still messy.

Python Disassembler

The Python disassembler is a great tool if you want to see how a Python statement would be done in bytecode. For example, let’s say you’re wondering which of the two commands is better to use:

x is not None

not x is None

We can use dis to figure out what the difference is.

from dis import dis

def func1(x):
    return x is not None

def func2(x):
    return not x is None

dis(func1)

  2           0 LOAD_FAST                0 (x)
              2 LOAD_CONST               0 (None)
              4 COMPARE_OP               9 (is not)
              6 RETURN_VALUE

dis(func2)

  2           0 LOAD_FAST                0 (x)
              2 LOAD_CONST               0 (None)
              4 COMPARE_OP               9 (is not)
              6 RETURN_VALUE

It turns out there is no difference (although I think the first is easier to read and is recommended by PEP-8).

Inspection

class Rectangle:
    def __init__(self, length, width):
        self.length = length
        self.width = width
        
    def get_area(self):
        return self.length * self.width

r = Rectangle(3,5)

r.get_area()

import inspect

print(inspect.getsource(Rectangle.__init__))

    def __init__(self, length, width):
        self.length = length
        self.width = width

print(inspect.getsource(Rectangle.get_area))

    def get_area(self):
        return self.length * self.width

Namespace Mangling

class MyClass:
    
    def __dunder_example(self):
        print("Dunder Example")

if you want to access MyClass.__dunder_example, you’ll need to use self._MyClass__dunder_example. This is especially useful when debugging.

mc = MyClass()

try:
    mc.__dunder_example()
except AttributeError:
    print("Because of namespace mangling, you can't access this")

Because of namespace mangling, you can't access this

mc._MyClass__dunder_example()

Dunder Example

Julius' Data Science Blog

Python Cheat Sheet II

Learning your environment

Where am I?

What verison of Python am I using?

Where is my Python interpreter located?

What conda environment am I in?

Where will Python look for modules?

Python modules

Where are the site packages held?

Where is a particular package?

Aliasing

Functional programming

Zip

Filter

Map

Reduce

Error handling

IndexError

NameError

TypeError

SyntaxError

Creating Your Own Exceptions

Try, except statements

Assert

Aliasing

Testing

Enums

IntEnums

Enums vs IntEnums

Scope

Type Annotations

Python Disassembler

Inspection

Namespace Mangling