keras-ptx_errors

RohitDhankar · RohitDhankar · commit 126c5d2a6e03 · 2023-09-13T07:48:17.000+05:30
diff --git a/readme/todo_list_readme.md b/readme/todo_list_readme.md
@@ -1,5 +1,87 @@
 
+- [ADVERSARIAL_AutoEncoder]
+
+
+
+- [Semi_Supervised_Learning_SGAN]
+
+#
+
+- [pytorch_torchvision_various_models](https://github.com/pytorch/vision/tree/main/references/classification#alexnet-and-vgg)
+> Trying Various Pre Trained - Torch Vision models for MNIST and CIFAR basic Transfer Learning - classification tasks 
+
+#
+
 
 - [1-detectron2-DensePose-CSE-Continuous_Surface_Embeddings](https://github.com/facebookresearch/detectron2/blob/main/projects/DensePose/doc/DENSEPOSE_CSE.md#animal-cse-models)
 - [1-detectron2-DensePose] 
-- [TODO--> VAE_variational_autoencoder]()
+- [TODO--> VAE_variational_autoencoder]()
+- [Number_Plate_license-plate-detection](https://paperswithcode.com/task/license-plate-detection)
+
+#
+
+> Ludwig -- Ludwig is a low-code framework for building custom AI models like LLMs and other deep neural networks.
+- [Ludwig.ai](https://ludwig.ai/latest/)
+
+#
+- [Vehicle_ID_Through_Traffic_VehicleRear](https://github.com/icarofua/vehicle-rear)
+> AUTHORS --Ícaro Oliveira de Oliveira, Rayson Laroca, David Menotti, Keiko Veronica Ono Fonseca, Rodrigo Minetto
+Vehicle-Rear: A New Dataset to Explore Feature Fusion For Vehicle Identification Using Convolutional Neural Networks
+
+> two-stream Convolutional Neural Network (CNN) that simultaneously uses two of the most distinctive and persistent features available: the vehicle’s appearance and its license plate. 
+
+#
+
+- [AppleMobile_TuriCreate_SupportVectorMachine_Classifier](https://apple.github.io/turicreate/docs/api/generated/turicreate.svm_classifier.create.html#turicreate.svm_classifier.create)
+> Apple Mobile -- SVM with TuriCreate 
+Create a SVMClassifier to predict the class of a binary target variable based on a model of which side of a hyperplane the example falls on. In addition to standard numeric and categorical types, features can also be extracted automatically from list- or dictionary-type SFrame columns.
+Zhang et al. - Modified Logistic Regression: An Approximation to SVM and its Applications in Large-Scale Text Categorization (ICML 2003)
+```python
+>>> data =  turicreate.SFrame('https://static.turi.com/datasets/regression/houses.csv')
+>>> data['is_expensive'] = data['price'] > 30000
+>>> model = turicreate.svm_classifier.create(data, 'is_expensive')
+```
+
+#
+
+- [ONNX_Runtime](https://onnxruntime.ai/)
+- https://onnxruntime.ai/index.html#getStartedTable
+- Train in Python but deploy into a C#/C++/Java app [Deploy_ONNX](https://onnxruntime.ai/docs/)
+> Get a model. This can be trained from any framework that supports export/conversion to ONNX format. 
+See the tutorials for some of the popular frameworks/libraries.
+- https://onnxruntime.ai/docs/api/python/tutorial.html
+- [Android_App_ONNX_Runtime](https://onnxruntime.ai/docs/tutorials/on-device-training/android-app.html)
+
+ 
+```python
+import onnxruntime as ort
+
+# Load the model and create InferenceSession
+model_path = "path/to/your/onnx/model"
+session = ort.InferenceSession(model_path)
+
+# Load and preprocess the input image inputTensor
+...
+
+# Run inference
+outputs = session.run(None, {"input": inputTensor})
+print(outputs)
+```
+#
+```python
+from skl2onnx import convert_sklearn
+from skl2onnx.common.data_types import FloatTensorType
+
+initial_type = [('float_input', FloatTensorType([None, 4]))]
+onx = convert_sklearn(clr, initial_types=initial_type)
+with open("logreg_iris.onnx", "wb") as f:
+    f.write(onx.SerializeToString())
+```
+
+#
+
+- [Android_Studio_TensorFlowLite](https://www.tensorflow.org/lite/android/quickstart)
+- https://stackoverflow.com/questions/49193985/fastest-way-to-run-recurrent-neural-network-inference-on-mobile-device
+- [TensorRt](https://github.com/NVIDIA/TensorRT)
+- 
+
diff --git a/src/basic_foo/keras_mnist/src/log_terminal_13Sep23_1.log b/src/basic_foo/keras_mnist/src/log_terminal_13Sep23_1.log
@@ -0,0 +1,139 @@
+
+/src/term_log_autoencoder_1.log
+
+2023-09-12 22:07:19.989922: I tensorflow/core/platform/cpu_feature_guard.cc:182] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.
+To enable the following instructions: AVX2 FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.
+2023-09-12 22:07:20.474719: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Could not find TensorRT
+2023-09-12 22:07:21.297792: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:995] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
+2023-09-12 22:07:21.316412: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:995] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
+2023-09-12 22:07:21.316686: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:995] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
+2023-09-12 22:07:21.317386: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:995] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
+2023-09-12 22:07:21.317615: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:995] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
+2023-09-12 22:07:21.317832: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:995] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
+2023-09-12 22:07:21.727511: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:995] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
+2023-09-12 22:07:21.727772: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:995] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
+2023-09-12 22:07:21.727977: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:995] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero. See more at https://github.com/torvalds/linux/blob/v6.0/Documentation/ABI/testing/sysfs-bus-pci#L344-L355
+2023-09-12 22:07:21.728168: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1639] Created device /job:localhost/replica:0/task:0/device:GPU:0 with 2237 MB memory:  -> device: 0, name: GeForce GTX 1650, pci bus id: 0000:01:00.0, compute capability: 7.5
+1 Physical GPUs, 1 Logical GPUs
+/home/dhankar/anaconda3/envs/env_tf2/lib/python3.9/site-packages/nvidia/cudnn/__init__.py
+Unique TRAIN DATA Labels and IMAGE Counts:  {0: 5923, 1: 6742, 2: 5958, 3: 6131, 4: 5842, 5: 5421, 6: 5918, 7: 6265, 8: 5851, 9: 5949}
+Unique TEST DATA Labels and IMAGE Counts:  {0: 980, 1: 1135, 2: 1032, 3: 1010, 4: 982, 5: 892, 6: 958, 7: 1028, 8: 974, 9: 1009}
+Shape--> x_train , y_train--, x_test, y_test---> 60000 60000 10000 10000
+--- 28
+--- 28
+---x_train.shape--- (60000, 28, 28)
+---x_train.shape, x_train_noise.shape---> (60000, 28, 28, 1) (60000, 28, 28, 1)
+Model: "Denoising_autoencoder"
+_________________________________________________________________
+ Layer (type)                Output Shape              Param #   
+=================================================================
+ input_1 (InputLayer)        [(None, 28, 28, 1)]       0         
+                                                                 
+ conv2d (Conv2D)             (None, 28, 28, 32)        320       
+                                                                 
+ batch_normalization (Batch  (None, 28, 28, 32)        128       
+ Normalization)                                                  
+                                                                 
+ max_pooling2d (MaxPooling2  (None, 14, 14, 32)        0         
+ D)                                                              
+                                                                 
+ conv2d_1 (Conv2D)           (None, 14, 14, 32)        9248      
+                                                                 
+ batch_normalization_1 (Bat  (None, 14, 14, 32)        128       
+ chNormalization)                                                
+                                                                 
+ max_pooling2d_1 (MaxPoolin  (None, 7, 7, 32)          0         
+ g2D)                                                            
+                                                                 
+ conv2d_2 (Conv2D)           (None, 7, 7, 32)          9248      
+                                                                 
+ batch_normalization_2 (Bat  (None, 7, 7, 32)          128       
+ chNormalization)                                                
+                                                                 
+ up_sampling2d (UpSampling2  (None, 14, 14, 32)        0         
+ D)                                                              
+                                                                 
+ conv2d_3 (Conv2D)           (None, 14, 14, 32)        9248      
+                                                                 
+ batch_normalization_3 (Bat  (None, 14, 14, 32)        128       
+ chNormalization)                                                
+                                                                 
+ up_sampling2d_1 (UpSamplin  (None, 28, 28, 32)        0         
+ g2D)                                                            
+                                                                 
+ conv2d_4 (Conv2D)           (None, 28, 28, 1)         289       
+                                                                 
+=================================================================
+Total params: 28865 (112.75 KB)
+Trainable params: 28609 (111.75 KB)
+Non-trainable params: 256 (1.00 KB)
+_________________________________________________________________
+None
+===== autoencoder ======
+--layer.name- input_1
+--layer.inbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc3522e0>]
+--layer.outbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc2ccf40>]
+===== autoencoder ======
+--layer.name- conv2d
+--layer.inbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc2ccf40>]
+--layer.outbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc2424f0>]
+===== autoencoder ======
+--layer.name- batch_normalization
+--layer.inbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc2424f0>]
+--layer.outbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc224bb0>]
+===== autoencoder ======
+--layer.name- max_pooling2d
+--layer.inbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc224bb0>]
+--layer.outbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc224b50>]
+===== autoencoder ======
+--layer.name- conv2d_1
+--layer.inbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc224b50>]
+--layer.outbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc1e1580>]
+===== autoencoder ======
+--layer.name- batch_normalization_1
+--layer.inbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc1e1580>]
+--layer.outbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc1f31c0>]
+===== autoencoder ======
+--layer.name- max_pooling2d_1
+--layer.inbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc1f31c0>]
+--layer.outbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc1eea90>]
+===== autoencoder ======
+--layer.name- conv2d_2
+--layer.inbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc1eea90>]
+--layer.outbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc17d400>]
+===== autoencoder ======
+--layer.name- batch_normalization_2
+--layer.inbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc17d400>]
+--layer.outbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc17d640>]
+===== autoencoder ======
+--layer.name- up_sampling2d
+--layer.inbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc17d640>]
+--layer.outbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc1ee1f0>]
+===== autoencoder ======
+--layer.name- conv2d_3
+--layer.inbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc1ee1f0>]
+--layer.outbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc224490>]
+===== autoencoder ======
+--layer.name- batch_normalization_3
+--layer.inbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc224490>]
+--layer.outbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc188fa0>]
+===== autoencoder ======
+--layer.name- up_sampling2d_1
+--layer.inbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc188fa0>]
+--layer.outbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc188d30>]
+===== autoencoder ======
+--layer.name- conv2d_4
+--layer.inbound_nodes- [<keras.src.engine.node.Node object at 0x7ff5dc188d30>]
+--layer.outbound_nodes- []
+Epoch 1/5
+2023-09-12 22:07:25.789057: I tensorflow/compiler/xla/stream_executor/cuda/cuda_dnn.cc:432] Loaded cuDNN version 8600
+2023-09-12 22:07:26.147469: E tensorflow/compiler/xla/stream_executor/gpu/asm_compiler.cc:114] *** WARNING *** You are using ptxas 11.0.194, which is older than 11.1. ptxas before 11.1 is known to miscompile XLA code, leading to incorrect results or invalid-address errors.
+
+2023-09-12 22:07:26.445899: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x7ff404fc9670 initialized for platform CUDA (this does not guarantee that XLA will be used). Devices:
+2023-09-12 22:07:26.445928: I tensorflow/compiler/xla/service/service.cc:176]   StreamExecutor device (0): GeForce GTX 1650, Compute Capability 7.5
+2023-09-12 22:07:26.449108: I tensorflow/compiler/mlir/tensorflow/utils/dump_mlir_util.cc:255] disabling MLIR crash reproducer, set env var `MLIR_CRASH_REPRODUCER_DIRECTORY` to enable.
+2023-09-12 22:07:26.492713: E tensorflow/compiler/xla/stream_executor/gpu/asm_compiler.cc:114] *** WARNING *** You are using ptxas 11.0.194, which is older than 11.1. ptxas before 11.1 is known to miscompile XLA code, leading to incorrect results or invalid-address errors.
+
+2023-09-12 22:07:26.524036: F tensorflow/compiler/xla/service/gpu/nvptx_compiler.cc:492] ptxas returned an error during compilation of ptx to sass: 'INTERNAL: ptxas exited with non-zero error code 65280, output: ptxas /tmp/tempfile-dhankar-1-74bb1998-9107-6052c0edf8252, line 5; fatal   : Unsupported .version 7.1; current version is '7.0'
+ptxas fatal   : Ptx assembly aborted due to errors
+'  If the error message indicates that a file could not be written, please verify that sufficient filesystem space is provided.
diff --git a/src/basic_foo/keras_mnist/src/test_1.py b/src/basic_foo/keras_mnist/src/test_1.py