Revised chunking implemented for Issue #67 for improved memory management #106

behreth · 2018-12-11T23:07:01Z

This closes issue #67 memory error on 32bit Python

Main change:

Created chunking logic to call the classifier with a maximum number of tests (detailed description as code comment).

In addition the following change was made:

Replaced the try/catch with an explicit check for the available function either decision_function or predict_proba.

Main change: - Created chunking logic to call the classifier with a maximum number of tests (detailed description as code comment). In addition the following change was made: - Replaced the try/catch with an explicit check for the available function either decision_function or predict_proba.

behreth · 2018-12-11T23:09:16Z

For details of history see PR #105

amueller · 2018-12-12T19:01:57Z

mglearn/plot_2d_separator.py

+    y_chunk_pos = 0
+    for x_chunk in np.array_split(X, np.arange(chunk_size,X_axis0_size,chunk_size,dtype=np.int32), axis=0):
+        Y_result_chunks.append(classifier_pred_or_decide(x_chunk))
+        y_chunk_pos += x_chunk.shape[0]


this variable is not used, is it?

Correct; and I inlined the
X_axis0_size
removing this as well.

amueller · 2018-12-12T19:03:56Z

mglearn/plot_2d_separator.py

+    # MLPClassifier(solver='lbfgs', random_state=0, hidden_layer_sizes=[1000,1000,1000])
+    # by reducing the value it is possible to trade in time for memory.
+    # It is possible to chunk the array as the calculations are independent of each other.
+    # Note: an intermittent version made a distinction between 32- and 64 bit architectures


Not sure if the note is necessary but also not opposed.

I think it is capturing a rationale from the past, so if you do not mind, I leave it.

amueller · 2018-12-12T19:04:31Z

mglearn/plot_2d_separator.py

+
+    # Call the classifier in chunks.
+    y_chunk_pos = 0
+    for x_chunk in np.array_split(X, np.arange(chunk_size,X_axis0_size,chunk_size,dtype=np.int32), axis=0):


can you please adhere to pep8? So spaces after , and not more than 79 chars per line.
But otherwise looks good, thank you!

Sure thanks for the patience - I just should have "listened" to my IDE.

well thanks for being patient with my nit-picks ;)

Minor refinements due PR feedback - Removed unnecessary variables and inlined one-time used variable. - Re-introduced the originally intended solver solver='lbfgs' - Adhered to PEP8, breaking lines and comments accordingly

amueller · 2018-12-13T15:07:26Z

awesome, thanks for your help :)

behreth mentioned this pull request Dec 11, 2018

Chunking implemented for Issue #67 for improved memory management #105

Closed

amueller reviewed Dec 12, 2018

View reviewed changes

This closes issue amueller#67 memory error on 32bit Python

bb34a1f

Minor refinements due PR feedback - Removed unnecessary variables and inlined one-time used variable. - Re-introduced the originally intended solver solver='lbfgs' - Adhered to PEP8, breaking lines and comments accordingly

amueller merged commit 4255705 into amueller:master Dec 13, 2018

amueller mentioned this pull request Dec 13, 2018

Problem/bug with MLPClassifier using neural network (P. 110, third release) #67

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revised chunking implemented for Issue #67 for improved memory management #106

Revised chunking implemented for Issue #67 for improved memory management #106

behreth commented Dec 11, 2018

behreth commented Dec 11, 2018

amueller Dec 12, 2018

behreth Dec 13, 2018

amueller Dec 12, 2018

behreth Dec 13, 2018

amueller Dec 12, 2018

behreth Dec 13, 2018

amueller Dec 13, 2018

amueller commented Dec 13, 2018

Revised chunking implemented for Issue #67 for improved memory management #106

Revised chunking implemented for Issue #67 for improved memory management #106

Conversation

behreth commented Dec 11, 2018

behreth commented Dec 11, 2018

amueller Dec 12, 2018

Choose a reason for hiding this comment

behreth Dec 13, 2018

Choose a reason for hiding this comment

amueller Dec 12, 2018

Choose a reason for hiding this comment

behreth Dec 13, 2018

Choose a reason for hiding this comment

amueller Dec 12, 2018

Choose a reason for hiding this comment

behreth Dec 13, 2018

Choose a reason for hiding this comment

amueller Dec 13, 2018

Choose a reason for hiding this comment

amueller commented Dec 13, 2018