How to get stable results with TensorFlow, setting random seed

In machine learning or deep learning model development using TensorFlow, ensuring the reproducibility of experimental results is crucial. Due to the randomness in weight initialization, dropout layers, and other components, the results of model training may vary each time. To achieve stable results, setting a random seed can mitigate the impact of this randomness.

Setting the Random Seed:

Setting the Global Seed: TensorFlow provides the tf.random.set_seed() function to set the global random seed, which affects all layers and functions that use random operations.

python
import tensorflow as tf
 tf.random.set_seed(42)

The value 42 is the seed, which can be set to any integer. Using the same seed value ensures that the generated random numbers are identical across different runs.

Ensuring Identical Initializers for Each Layer: When defining model layers, explicitly specify the weight initializer and set its random seed. For example, when using the GlorotUniform initializer:

python
initializer = tf.keras.initializers.GlorotUniform(seed=42)
 model.add(tf.keras.layers.Dense(128, kernel_initializer=initializer))

Controlling Randomness in Other Libraries: If your TensorFlow project also uses other libraries (such as NumPy or Python's built-in random module), set their random seeds as well:

python
import numpy as np
 import random
 np.random.seed(42)
 random.seed(42)

Example: Building a Simple Model

The following example demonstrates how to set the random seed when building a simple neural network:

python
import tensorflow as tf
import numpy as np
import random

# Setting global seed
 tf.random.set_seed(42)
 np.random.seed(42)
 random.seed(42)

# Model building
 model = tf.keras.Sequential([
    tf.keras.layers.Dense(128, activation='relu', kernel_initializer=tf.keras.initializers.GlorotUniform(seed=42)),
    tf.keras.layers.Dropout(0.2),
    tf.keras.layers.Dense(10, activation='softmax')
])

# Compiling the model
 model.compile(optimizer='adam',
              loss='sparse_categorical_crossentropy',
              metrics=['accuracy'])

# Simulating data
 x_train = np.random.random((1000, 32))
 y_train = np.random.randint(10, size=(1000, ))

# Training the model
 model.fit(x_train, y_train, epochs=10)

By implementing these settings, each run of the code will produce consistent results, even if the training process involves random operations, because all potential sources of randomness are controlled. In summary, setting a random seed ensures the reproducibility of model training and experiments, which is critical for scientific research and model validation in production environments.

2024年8月10日 14:09 回复

1个答案

Setting the Random Seed:

Example: Building a Simple Model

你的答案