C++ `thread_local` 存储：线程局部变量的高级应用

好的，各位观众，欢迎来到今天的“C++ thread_local 存储：线程局部变量的高级应用”专场脱口秀。今天咱们不讲段子，讲点硬核的，但是保证各位听得懂，笑得出来（如果实在笑不出来，就当我在一本正经地胡说八道）。

开场白：全局变量的烦恼

话说，在多线程编程的世界里，全局变量就像个公共厕所，谁都能进去拉一泡（呸！谁都能访问）。虽然方便，但是很容易出现“拉肚子拉串了”的情况（数据竞争），导致程序崩溃，让你欲哭无泪。

#include <iostream>
#include <thread>

int global_counter = 0;

void increment_counter() {
  for (int i = 0; i < 100000; ++i) {
    global_counter++; // 多个线程同时修改，数据竞争！
  }
}

int main() {
  std::thread t1(increment_counter);
  std::thread t2(increment_counter);

  t1.join();
  t2.join();

  std::cout << "Global counter: " << global_counter << std::endl; // 结果通常不是 200000
  return 0;
}

这段代码，两个线程同时修改 global_counter，结果往往不是我们期望的 200000。这就是数据竞争的威力，让你怀疑人生。

为了解决这个问题，我们通常会祭出锁（Mutex）这个神器，给全局变量上把锁，确保同一时间只有一个线程可以访问。

#include <iostream>
#include <thread>
#include <mutex>

int global_counter = 0;
std::mutex counter_mutex;

void increment_counter() {
  for (int i = 0; i < 100000; ++i) {
    std::lock_guard<std::mutex> lock(counter_mutex); // 加锁，保证线程安全
    global_counter++;
  }
}

int main() {
  std::thread t1(increment_counter);
  std::thread t2(increment_counter);

  t1.join();
  t2.join();

  std::cout << "Global counter: " << global_counter << std::endl; // 结果是 200000，万岁！
  return 0;
}

加锁之后，世界清净了，数据安全了。但是，锁也不是万能的，它会带来性能开销，降低程序的并发度。想象一下，高峰期的公共厕所，大家都排队等着，效率能高吗？

救星登场：thread_local 变量

这时候，thread_local 变量就像一位救星一样，闪亮登场。它告诉编译器：“哥们儿，这个变量是线程私有的，每个线程都有自己的副本，互不干扰！”

#include <iostream>
#include <thread>

thread_local int thread_counter = 0; // 每个线程都有自己的 thread_counter

void increment_counter() {
  for (int i = 0; i < 100000; ++i) {
    thread_counter++; // 每个线程操作自己的副本，没有数据竞争
  }
  std::cout << "Thread counter: " << thread_counter << std::endl;
}

int main() {
  std::thread t1(increment_counter);
  std::thread t2(increment_counter);

  t1.join();
  t2.join();

  // main 函数中的 thread_counter 未被初始化，值为 0
  std::cout << "Main thread counter: " << thread_counter << std::endl;
  return 0;
}

在这个例子中，每个线程都有自己的 thread_counter 变量，它们互不影响，所以不需要加锁，也能保证数据安全。是不是感觉世界又美好了？

thread_local 的特性

为了让大家更深入地了解 thread_local，我们来总结一下它的特性：

特性	说明
线程私有性	每个线程都拥有变量的独立副本，互不干扰。
生命周期	变量的生命周期与线程的生命周期相同。线程创建时分配内存，线程结束时释放内存。
初始化	每个线程第一次访问 `thread_local` 变量时，都会执行初始化操作。如果没有显式初始化，则会进行默认初始化（例如，int 类型初始化为 0，指针类型初始化为 nullptr）。
作用域	作用域与其他变量相同，取决于变量的声明位置。
存储位置	存储在线程本地存储 (Thread Local Storage, TLS) 中，通常由操作系统管理。
使用场景	需要线程隔离的数据，例如：错误码、随机数生成器、数据库连接等。避免使用锁，提高并发性能。

thread_local 的高级应用

好了，说了这么多基础知识，现在我们来聊聊 thread_local 的高级应用。

线程安全的单例模式

单例模式是一种常用的设计模式，用于确保一个类只有一个实例。在多线程环境下，传统的单例模式需要加锁才能保证线程安全。但是，使用 thread_local 可以实现线程安全的单例模式，避免锁的开销。

#include <iostream>
#include <thread>

class Singleton {
private:
  Singleton() { std::cout << "Singleton created." << std::endl; }
  ~Singleton() { std::cout << "Singleton destroyed." << std::endl; }

  // 禁止拷贝构造和赋值
  Singleton(const Singleton&) = delete;
  Singleton& operator=(const Singleton&) = delete;

public:
  static Singleton& getInstance() {
    thread_local static Singleton instance; // 线程安全的单例
    return instance;
  }

  void doSomething() { std::cout << "Singleton doing something in thread " << std::this_thread::get_id() << std::endl; }
};

void thread_function() {
  Singleton& instance = Singleton::getInstance();
  instance.doSomething();
}

int main() {
  std::thread t1(thread_function);
  std::thread t2(thread_function);

  t1.join();
  t2.join();

  return 0;
}

在这个例子中，thread_local static Singleton instance 保证了每个线程都有自己的 Singleton 实例，避免了多线程竞争。

为每个线程提供独立的随机数生成器

在多线程程序中，如果所有线程都使用同一个随机数生成器，可能会导致随机数序列出现重复，影响程序的正确性。使用 thread_local 可以为每个线程提供独立的随机数生成器，保证随机数序列的独立性。

#include <iostream>
#include <thread>
#include <random>

thread_local std::mt19937 generator(std::hash<std::thread::id>()(std::this_thread::get_id())); // 每个线程都有自己的随机数生成器

int generate_random_number(int min, int max) {
  std::uniform_int_distribution<int> distribution(min, max);
  return distribution(generator);
}

void thread_function() {
  for (int i = 0; i < 5; ++i) {
    std::cout << "Thread " << std::this_thread::get_id() << ": " << generate_random_number(1, 100) << std::endl;
  }
}

int main() {
  std::thread t1(thread_function);
  std::thread t2(thread_function);

  t1.join();
  t2.join();

  return 0;
}

这段代码为每个线程创建了一个独立的 std::mt19937 随机数生成器，并使用线程 ID 作为种子，保证了随机数序列的唯一性。

简化线程上下文管理

在某些情况下，我们需要在不同的函数之间传递线程上下文信息，例如：用户 ID、事务 ID 等。使用 thread_local 可以将这些信息存储在线程本地变量中，避免显式传递，简化代码。

#include <iostream>
#include <thread>

thread_local int user_id = 0; // 存储当前线程的用户 ID

void set_user_id(int id) {
  user_id = id;
}

int get_user_id() {
  return user_id;
}

void process_request() {
  std::cout << "Processing request for user " << get_user_id() << " in thread " << std::this_thread::get_id() << std::endl;
}

void thread_function(int id) {
  set_user_id(id);
  process_request();
}

int main() {
  std::thread t1(thread_function, 123);
  std::thread t2(thread_function, 456);

  t1.join();
  t2.join();

  return 0;
}

在这个例子中，user_id 存储了当前线程的用户 ID，process_request 函数可以直接访问 user_id，无需显式传递。

错误码管理

在多线程环境下，错误码的处理也需要特别注意。如果使用全局变量存储错误码，可能会出现多个线程同时修改错误码的情况，导致错误信息混乱。使用 thread_local 可以为每个线程提供独立的错误码存储空间，避免线程之间的干扰。

#include <iostream>
#include <thread>

thread_local int error_code = 0;

void set_error_code(int code) {
  error_code = code;
}

int get_error_code() {
  return error_code;
}

void perform_operation() {
  // 模拟操作失败
  set_error_code(1);
  std::cout << "Operation failed in thread " << std::this_thread::get_id() << std::endl;
}

void thread_function() {
  perform_operation();
  if (get_error_code() != 0) {
    std::cout << "Thread " << std::this_thread::get_id() << " encountered an error: " << get_error_code() << std::endl;
  }
}

int main() {
  std::thread t1(thread_function);
  std::thread t2(thread_function);

  t1.join();
  t2.join();

  return 0;
}

避免递归函数中的静态变量竞争

递归函数中的静态变量在多线程环境下也可能出现竞争。thread_local 可以为每个线程提供静态变量的独立副本，避免竞争。

#include <iostream>
#include <thread>

void recursive_function(int n) {
  thread_local static int depth = 0; // 每个线程都有自己的 depth

  std::cout << "Thread " << std::this_thread::get_id() << ", Depth: " << depth << ", n: " << n << std::endl;
  depth++;

  if (n > 0) {
    recursive_function(n - 1);
  }

  depth--;
}

void thread_function() {
  recursive_function(5);
}

int main() {
  std::thread t1(thread_function);
  std::thread t2(thread_function);

  t1.join();
  t2.join();

  return 0;
}

注意事项

虽然 thread_local 很好用，但是也有一些需要注意的地方：

性能开销： 访问 thread_local 变量通常比访问普通变量要慢，因为它需要访问线程本地存储。
初始化顺序： thread_local 变量的初始化顺序是不确定的，可能会导致一些问题。
销毁顺序： thread_local 变量的销毁顺序也是不确定的，可能会导致一些问题。
平台依赖性： thread_local 的实现可能因平台而异。

总结

thread_local 变量是 C++ 中一个非常有用的特性，它可以帮助我们编写线程安全、高性能的多线程程序。但是，在使用 thread_local 时，需要注意一些细节，避免出现问题。

结尾

好了，今天的“C++ thread_local 存储：线程局部变量的高级应用”专场脱口秀就到这里了。希望大家通过今天的学习，能够更好地理解和使用 thread_local 变量。记住，编程就像生活，需要不断学习，不断探索，才能找到属于自己的乐趣。谢谢大家！

发表回复 取消回复

发表回复取消回复