如何优雅地利用c++编程从1乘到20？

当然！在 C++ 中优雅地实现从 1 乘到 20，我们可以有多种方法，每种方法都有其独特的“优雅”之处。这里我将为你详细解释几种常见且优雅的实现方式，并分析它们的优缺点。

核心目标：计算 1 2 3 ... 20 的值。

“优雅”的定义：在编程中，“优雅”通常意味着代码具有以下特点：

清晰易懂 (Readability): 代码容易被其他开发者（甚至是你自己）理解。
简洁高效 (Conciseness & Efficiency): 用最少的代码完成任务，并且在性能上没有不必要的损耗。
可维护性 (Maintainability): 代码容易修改和扩展。
健壮性 (Robustness): 能处理潜在的问题（尽管在这个简单的例子中问题较少）。
符合编程范式 (Idiomatic): 遵循 C++ 的语言习惯和最佳实践。

方法一：使用 `for` 循环 (最经典、最直观)

这是最常见也是最容易理解的方法。我们使用一个循环来迭代从 1 到 20 的每个数字，并将它们累乘到结果变量中。

```cpp
include
include // 为了使用 int64_t 以防溢出

int main() {
// 使用 int64_t 来存储结果，因为从1乘到20的积会比较大，
// int 类型可能会发生溢出。
// 阶乘增长非常快，20! 已经超出了 32 位整数的范围。
int64_t product = 1;

// 循环从 1 遍历到 20
for (int i = 1; i <= 20; ++i) {
product = i; // 将当前数字 i 乘到 product 中
}

std::cout << "从 1 乘到 20 的结果是: " << product << std::endl;

return 0;
}
```

详细解释：

1. `include `: 包含输入输出流库，用于 `std::cout` 输出结果。
2. `include `: 这是一个非常重要的头文件，它提供了固定宽度的整数类型，如 `int64_t`。`int64_t` 保证了它是一个 64 位有符号整数。
为什么需要 `int64_t`？我们计算的是 20 的阶乘 (20!)。阶乘的增长速度非常快。
10! = 3,628,800
15! = 1,307,674,368,000
20! = 2,432,902,008,176,640,000
一个标准的 32 位有符号整数（`int` 或 `long`，取决于平台）最大值通常是 `2,147,483,647`。很显然，20! 远远超过了这个范围，会导致溢出，产生错误的结果。
64 位有符号整数 (`int64_t`) 的最大值大约是 `9 x 10^18`，足以容纳 20!。
3. `int64_t product = 1;`:
我们声明一个变量 `product` 来存储累乘的结果。
初始化为 `1` 是至关重要的。乘法运算的单位元是 1。如果我们初始化为 0，那么任何数乘以 0 都等于 0，结果永远是 0。
4. `for (int i = 1; i <= 20; ++i)`:
这是一个标准的 `for` 循环。
`int i = 1`: 初始化循环计数器 `i` 为 1。
`i <= 20`: 循环的继续条件。只要 `i` 小于或等于 20，循环就会继续执行。
`++i`: 每次循环结束后，将 `i` 的值增加 1。这是前缀自增，在 C++ 中通常比后缀自增 (`i++`) 更推荐用于循环计数器，因为它避免了创建临时变量的开销（尽管现代编译器通常会优化掉这个开销，但前缀自增是更“本真”的写法）。
5. `product = i;`: 这是累乘的核心操作。它等价于 `product = product i;`。在每次循环中，我们将当前的 `product` 值与当前的 `i` 值相乘，并将结果重新赋给 `product`。
6. `std::cout << ... << std::endl;`: 将最终计算出的 `product` 值输出到控制台，并换行。
7. `return 0;`: 表示程序成功执行。

优雅之处：

清晰直观：代码逻辑直接反映了“从 1 乘到 20”的数学定义。
易于理解：任何接触过编程的人都能轻松看懂。
高效：循环执行 20 次，操作简单，性能极佳。
健壮：通过使用 `int64_t`，有效避免了溢出问题。

方法二：使用 `std::accumulate` 和 lambda 表达式 (更函数式、更简洁)

C++ 标准库提供了强大的算法，可以让我们用更简洁的方式表达意图。`std::accumulate` 就可以用来实现这种累加（或累乘）操作。

```cpp
include
include // 用于创建数据序列
include // 包含 std::accumulate
include // 为了使用 int64_t

int main() {
// 1. 创建一个从 1 到 20 的数字序列
std::vector numbers;
for (int i = 1; i <= 20; ++i) {
numbers.push_back(i);
}

// 2. 使用 std::accumulate 进行累乘
// std::accumulate(begin_iterator, end_iterator, initial_value, binary_operation)
// initial_value 的类型决定了累加结果的类型，所以这里用 int64_t(1)
int64_t product = std::accumulate(
numbers.begin(), // 起始迭代器
numbers.end(), // 结束迭代器
int64_t(1), // 初始值 (注意类型转换)
[](int64_t acc, int val) { // lambda 表达式定义累乘操作
return acc val;
}
);

std::cout << "从 1 乘到 20 的结果是: " << product << std::endl;

return 0;
}
```

更简洁的写法，直接生成序列（C++20 起）：

```cpp
include
include // 包含 std::accumulate
include // 包含 std::views::iota
include // 为了使用 int64_t

int main() {
// 使用 ranges::iota 创建一个从 1 到 20 的序列 (C++20)
auto numbers_view = std::views::iota(1, 21); // [1, 21)

// 使用 std::accumulate 进行累乘
int64_t product = std::accumulate(
numbers_view.begin(),
numbers_view.end(),
int64_t(1),
[](int64_t acc, int val) {
return acc val;
}
);

std::cout << "从 1 乘到 20 的结果是: " << product << std::endl;

return 0;
}
```

详细解释：

1. `include ` / `include `:
`std::vector` 是一个动态数组，用于存储 1 到 20 的数字。
`std::ranges` 和 `std::views::iota`（C++20 新特性）提供了一种更声明式的方式来生成一个整数序列。`std::views::iota(1, 21)` 创建了一个“视图”，表示从 1 开始，直到但不包括 21 的整数序列。
2. `std::accumulate`:
这是一个非常通用的算法，用于对一个范围内的元素进行累加（或任何其他二元操作）。
它有四个参数：
`numbers.begin()`: 序列的起始迭代器。
`numbers.end()`: 序列的结束迭代器（指向最后一个元素的下一个位置）。
`int64_t(1)`: 初始值。这是最关键的地方。`std::accumulate` 的返回值类型将由这个初始值的类型决定。我们将其设为 `int64_t(1)`，确保累乘结果使用 64 位整数，并以 1 开始。
`[](int64_t acc, int val) { return acc val; }`: 这是一个 lambda 表达式。它定义了在每次迭代中执行的二元操作。
`int64_t acc`: 累积器（Accumulator），代表到目前为止的中间结果。它的类型与初始值相同。
`int val`: 当前正在处理的序列中的元素。
`return acc val;`: 将累积器与当前值相乘，并将结果返回作为新的累积器值。
3. C++20 `std::views::iota`:
这种方式无需手动创建一个 `std::vector` 并填充它，而是直接生成一个序列视图，更加简洁和内存高效（因为不需要实际存储所有数字）。

优雅之处：

简洁声明式：你声明“我想要累乘一个范围的数字”，而不是详细说明如何循环和更新变量。
函数式风格：借鉴了函数式编程的思想，将操作封装在 lambda 表达式中。
可复用性： `std::accumulate` 是一个通用的算法，可以用于各种求和、求积、连接字符串等操作，代码更具通用性。
C++20 版本更优：使用 `ranges` 进一步提升了简洁性和表达力。

需要注意：

`std::accumulate` 的初始值类型非常重要，决定了最终结果的类型和溢出行为。

方法三：使用递归 (概念上的优雅，但效率不高)

虽然不是最实用的方法（对于阶乘来说，递归通常比迭代效率低，且有栈溢出的风险），但从概念上讲，递归是一种非常优雅的数学定义方式。

```cpp
include
include

// 递归函数计算阶乘
// 注意：对于较大的 n，可能会导致栈溢出，并且效率低于迭代
int64_t factorial(int n) {
// 基本情况 (Base Case): 0! = 1
if (n == 0) {
return 1;
}
// 递归步骤 (Recursive Step): n! = n (n1)!
else {
return static_cast(n) factorial(n 1);
}
}

int main() {
// 计算 20 的阶乘
int number = 20;
int64_t result = factorial(number);

std::cout << "从 1 乘到 " << number << " 的结果是: " << result << std::endl;

return 0;
}
```

详细解释：

1. `int64_t factorial(int n)`: 定义一个函数 `factorial`，它接收一个整数 `n`，并返回 `n` 的阶乘。返回值类型同样是 `int64_t` 以防止溢出。
2. 基本情况 `if (n == 0)`:
递归必须有一个停止条件，称为基本情况。0 的阶乘被定义为 1。当 `n` 减到 0 时，递归停止，返回 1。
3. 递归步骤 `else { return static_cast(n) factorial(n 1); }`:
这是递归的核心。如果 `n` 不为 0，则函数返回 `n` 乘以 `(n1)` 的阶乘。
`static_cast(n)`：确保在乘法发生之前，`n` 被转换为 `int64_t` 类型，以匹配 `factorial(n1)` 的返回类型，并参与到 64 位运算中，防止中间乘积溢出。
`factorial(n 1)`：这是对函数自身的调用，但参数是 `n1`。这个调用会继续下去，直到达到基本情况。

优雅之处（概念上）：

数学定义映射：递归完美地映射了数学上阶乘的定义 (`n! = n (n1)!`)，概念清晰。
简洁性：函数的定义本身非常简洁。

缺点（不优雅之处）：

效率低下：每次函数调用都会有函数调用的开销（创建栈帧、传递参数、返回地址等）。对于像阶乘这样可以用简单循环解决的问题，递归的效率远不如迭代。
栈溢出风险：如果计算非常大的数字的阶乘，递归深度会很大，可能导致栈溢出错误。
可读性相对较差：对于不熟悉递归的人来说，理解递归的执行流程可能比迭代更困难一些。

总结与选择

对于“从 1 乘到 20”这个具体任务，哪种方法最“优雅”？

方法一 (`for` 循环)：最推荐。它在清晰度、效率和简洁性之间取得了最佳平衡。对于大多数 C++ 开发者来说，这是最自然、最容易理解和维护的方式。
方法二 (`std::accumulate`)：非常优雅，尤其推荐给熟悉 C++ STL 的开发者。它展示了利用标准库算法来声明式地解决问题的能力。如果使用 C++20 的 `ranges`，则更加简洁和现代。如果你的代码库已经广泛使用了 STL 算法，那么这种风格会更一致。
方法三 (递归)：概念上优雅，实践中不推荐。它更多的是一种展示对递归理解的方式，但在这个特定场景下，它不是最实用或最高效的选择。

最终选择取决于你的具体上下文、团队的熟悉程度以及你对“优雅”的偏好。但对于初学者或者追求稳健实用的代码，`for` 循环无疑是最佳选择。如果你想展示 C++ 的现代特性和函数式编程风格，`std::accumulate` 是一个很好的选择。

希望这些详细的解释能帮助你理解如何在 C++ 中优雅地解决这个问题！

网友意见

数学家版本：

       #include <iostream> #include <cmath> int main() {     std::cout << std::tgamma(20 + 1) << std::endl; }

语言学家版本：

       #include <iostream> #include <utility>  template<std::size_t...I> constexpr auto foo(std::index_sequence<I...>) { return ((I+1) * ...); }  int main()  {     std::cout << foo(std::make_index_sequence<20>()) << std::endl; }

“快速”版本：

       #include <iostream> #include <future>  long long foo(int a, int b, std::future<long long> last = std::async(std::integral_constant<long long, 1>())) {     return a == b ? a * last.get() : foo((a + b) / 2 + 1, b, std::async(foo, a, (a + b) / 2, std::move(last))); }  int main() {     std::cout << foo(1, 20) << std::endl; }

历史学家版本：

       #include <stdio.h> void main(void) {     int i;     long long j;     for(i = 1, j = 1;i <= 20; j *= i++);         printf("%lld", j); }

敏捷开发上线1.0版本：

       #include <stdio.h> int main() {     //printf("%d", 1*2*3*4*5*6*7*8*9*10);     printf("%lld", (long long)1*2*3*4*5*6*7*8*9*10*11*12*13*14*15*16*17*18*19*20); }

面向对象专家版本：

       #include <iostream> #include <string> #include <memory>  struct IBaseInterface {     virtual ~IBaseInterface() = 0; }; inline IBaseInterface::~IBaseInterface() = default;  struct IDataProvider : virtual public IBaseInterface {     virtual int first() = 0;     virtual int last() = 0;     virtual int next(int v) = 0; };  struct ICalculator : virtual public IBaseInterface {     virtual long long calc(IDataProvider *) = 0; };  struct IPrinter : virtual public IBaseInterface {     virtual void print(const std::string &) = 0; };  struct ISerializer : virtual public IBaseInterface {     virtual std::string serialize(long long value) = 0; };  struct IRunnable : virtual public IBaseInterface {     virtual void run() = 0; };  class Foo : virtual public IRunnable {     std::shared_ptr<IDataProvider> m_dp;     std::shared_ptr<ICalculator> m_c;     std::shared_ptr<ISerializer> m_s;     std::shared_ptr<IPrinter> m_p; public:     Foo(std::shared_ptr<IDataProvider> dp, std::shared_ptr<ICalculator> c, std::shared_ptr<ISerializer> s, std::shared_ptr<IPrinter> p) : m_dp(std::move(dp)), m_c(std::move(c)), m_s(std::move(s)),m_p(std::move(p)) {}     void run() override { return  m_p->print(m_s->serialize(m_c->calc(m_dp.get()))); } };  class DefaultSerializer : virtual public ISerializer { public:     std::string serialize(long long value) override { return std::to_string(value); } };  class StreamPrinter : virtual public IPrinter {     std::ostream &m_os; public:     explicit StreamPrinter (std::ostream &os) : m_os(os) {}     void print(const std::string &s)  override { m_os << s << std::endl; } };  class MultiplyAccumulateCalculator : virtual public ICalculator { public:     long long calc(IDataProvider *dp) override {         int i = dp->first();         long long j = i;         do             j *= (i = dp->next(i));         while(i != dp->last());         return j;     } };  int main() {     struct MyDataProvider : virtual public IDataProvider {         int first() override { return 1; }         int last() override { return 20; }         int next(int v) override { return v+1; }     };     Foo foo(std::make_shared<MyDataProvider>(), std::make_shared<MultiplyAccumulateCalculator>(), std::make_shared<DefaultSerializer>(), std::make_shared<StreamPrinter>(std::cout));     foo.run(); }

提前优化的并行版本：

       #include <iostream> #include <xmmintrin.h>  double foo(int x) {     __m128 a = {1.0f, 2.0f, 3.0f, 4.0f};     __m128 b = {4.0f, 4.0f, 4.0f, 4.0f};     __m128 c = {1.0f, 1.0f, 1.0f, 1.0f};     for(int i = 0; i < x / 4; ++i, a = _mm_add_ps(a, b))         c = _mm_mul_ps(c, a);     for(int i = x % 4; i < 4; ++i)         a[i] = 1.0f;     c = _mm_mul_ps(c, a);     return (double)c[0] * (double)c[1] * (double)c[2] * (double)c[3]; }  int main() {     std::cout << foo(20) << std::endl; }

“宏孩儿”元编程版：

       #include <boost/preprocessor.hpp>  // 由于boost.preprocessor仅提供255以下的整数运算 // 所以使用sequence来 (十位个位)(千位百位)(十万位万位) 的方式来表示大整数。  // 不进位加法：(77)(66)(55) + (44)(33)(22) = (121)(99)(77) #define PP_ADD_N_N_CARRY_OP(R, DATA, I, ELEM) (BOOST_PP_ADD(BOOST_PP_SEQ_ELEM(I, DATA), ELEM)) #define PP_ADD_N_N_CARRY(SEQ_A, SEQ_B) BOOST_PP_SEQ_FOR_EACH_I(PP_ADD_N_N_CARRY_OP, SEQ_A, SEQ_B)  // 进位加法：(121)(99)(77) = （21)(0)(78) // 注意SEQ_A的长度要比SEQ_B长 #define PP_ADD_N_N_OP(S, STATE, ELEM_CARRY)      BOOST_PP_SEQ_PUSH_FRONT(              BOOST_PP_SEQ_REPLACE(STATE, 0, BOOST_PP_MOD(BOOST_PP_ADD(BOOST_PP_SEQ_HEAD(STATE), ELEM_CARRY), 100)),              BOOST_PP_DIV(BOOST_PP_ADD(BOOST_PP_SEQ_HEAD(STATE), ELEM_CARRY), 100)              ) #define PP_ADD_N_N(SEQ_A, SEQ_B) BOOST_PP_SEQ_REVERSE(BOOST_PP_SEQ_FOLD_LEFT(PP_ADD_N_N_OP, BOOST_PP_SEQ_NIL(0), PP_ADD_N_N_CARRY(SEQ_A, SEQ_B)))  // 没什么好说的，X*N = X+X+X+X+X+...+X #define PP_MUL_N_1_EXP_OP(Z, I, DATA) (DATA) #define PP_MUL_N_1_EXP(SEQ_N, N) BOOST_PP_REPEAT(N, PP_MUL_N_1_EXP_OP, SEQ_N) #define PP_MUL_N_1_MYOP(S, STATE, ITEM) PP_ADD_N_N(STATE, ITEM) #define PP_MUL_N_1_FWD(EXP) BOOST_PP_SEQ_FOLD_LEFT(PP_MUL_N_1_MYOP, BOOST_PP_SEQ_HEAD(EXP), BOOST_PP_SEQ_TAIL(EXP)) #define PP_MUL_N_1(SEQ_N, N) PP_MUL_N_1_FWD(PP_MUL_N_1_EXP(SEQ_N, N))  #define FACT5 PP_MUL_N_1(PP_MUL_N_1(PP_MUL_N_1(PP_MUL_N_1((1), 2), 3), 4), 5) #define FACT10 PP_MUL_N_1(PP_MUL_N_1(PP_MUL_N_1(PP_MUL_N_1(PP_MUL_N_1(FACT5, 6), 7), 8), 9), 10) #define FACT15 PP_MUL_N_1(PP_MUL_N_1(PP_MUL_N_1(PP_MUL_N_1(PP_MUL_N_1(FACT10, 11), 12), 13), 14), 15) #define FACT20 PP_MUL_N_1(PP_MUL_N_1(PP_MUL_N_1(PP_MUL_N_1(PP_MUL_N_1(FACT15, 16), 17), 18), 19), 20) #define FACT25 PP_MUL_N_1(PP_MUL_N_1(PP_MUL_N_1(PP_MUL_N_1(PP_MUL_N_1(FACT20, 21), 22), 23), 24), 25)  static_assert(false, BOOST_PP_STRINGIZE(FACT10));

警告：目前只有Clang能算出FACT20，编译缓慢是十分正常的，请耐心等待。默认计算10的阶乘10!=3628800，期待输出：

       error: static_assert failed "(0) (88) (62) (3) (0) (0) (0) (0) (0) (0)"

真·模板元编程版本（大整数）

       #include <iostream> #include <iomanip> #include <type_traits>  using BaseType_t = long long; constexpr BaseType_t lgBase = 9; // 注意10000*10000刚刚好小于int的取值范围 constexpr BaseType_t Base = 1000000000; // 注意10000*10000刚刚好小于int的取值范围  // 大整数的表示 template<BaseType_t...I> struct BigInteger {     using type = BigInteger; };  // 连接 template<class T1, class T2> struct BI_Cat; template<BaseType_t...I1, BaseType_t...I2> struct BI_Cat <BigInteger<I1...>, BigInteger<I2...>> : BigInteger<I1..., I2...> {};  // 左移一个单元（即*Base） template<class T> struct BI_SHL; template<BaseType_t...I> struct BI_SHL<BigInteger<I...>> : BigInteger<I..., 0> {};  // 去除开头的0 template<class T> struct BI_Remove_Zeros : T {}; template<BaseType_t...I> struct BI_Remove_Zeros<BigInteger<0, I...>> : BI_Remove_Zeros<BigInteger<I...>> {};  // 填充0到N个单元 template<int X, class IS> struct BI_Fill_Impl; template<int X, class T, T...I> struct BI_Fill_Impl<X, std::integer_sequence<T, I...>> : BigInteger<(I, X)...> {}; template<int Size> struct BI_Fill_Zeros : BI_Fill_Impl<0, std::make_index_sequence<Size>> {};  template<class T, int N> struct BI_Resize; template<BaseType_t...I, int N> struct BI_Resize<BigInteger<I...>, N> : BI_Cat<typename BI_Fill_Zeros<N - sizeof...(I)>::type, BigInteger<I...>> {};  // 返回较大的数值 template<int A, int B> struct int_min : std::integral_constant<int, (A<B?B:A)> {};  // 非进位加法：先把两个数的位数改成一样的然后依次相加 template<class A, class B, class ShouldResize> struct BI_AddNotCarry_Impl; template<BaseType_t...I1, BaseType_t...I2> struct BI_AddNotCarry_Impl <BigInteger<I1...>, BigInteger<I2...>, std::true_type> : BigInteger<(I1 + I2)...> {};  template<BaseType_t...I1, BaseType_t...I2> struct BI_AddNotCarry_Impl <BigInteger<I1...>, BigInteger<I2...>, std::false_type>      : BI_AddNotCarry_Impl<          typename BI_Resize<BigInteger<I1...>, int_min<sizeof...(I1), sizeof...(I2)>::value>::type,          typename BI_Resize<BigInteger<I2...>, int_min<sizeof...(I1), sizeof...(I2)>::value>::type,          std::true_type      >{};  template<class A, class B> struct BI_AddNotCarry; template<BaseType_t...I1, BaseType_t...I2> struct BI_AddNotCarry <BigInteger<I1...>, BigInteger<I2...>>     : BI_AddNotCarry_Impl<BigInteger<I1...>, BigInteger<I2...>, std::bool_constant<sizeof...(I1) == sizeof...(I2)>> {};  // 判断是否为0 template<class Y> struct BI_IsZero; template<BaseType_t...I> struct BI_IsZero<BigInteger<I...>> : std::bool_constant<((I == 0) && ...)> {};  // 自动进位 template<class A> struct BI_Carry; template<class A, class B> struct BI_Add : BI_Carry<typename BI_AddNotCarry<A, B>::type> {};  template<class Mod, class Div, class ShouldCalc = typename BI_IsZero<Div>::type> struct BI_Carry_Impl; template<class Mod, class Div> struct BI_Carry_Impl<Mod, Div, std::true_type> : Mod {}; template<class Mod, class Div> struct BI_Carry_Impl<Mod, Div, std::false_type>         : BI_Add<Mod, typename BI_SHL<Div>::type > {}; template<BaseType_t...I> struct BI_Carry<BigInteger<I...>>         : BI_Remove_Zeros<typename BI_Carry_Impl<BigInteger<(I % Base)...>, BigInteger<(I / Base)...>>::type> {};  // 乘以X并自动进位 template<class A, int X> struct BI_MulX; template<BaseType_t...I1, int X> struct BI_MulX <BigInteger<I1...>, X>         : BI_Carry<BigInteger<(I1 * X)...>> {};  // 计算阶乘 template<int X> struct BI_Fact : BI_MulX<typename BI_Fact<X-1>::type, X> {}; template<> struct BI_Fact<0> : BigInteger<1> {};  template<BaseType_t...I> std::ostream &operator<<(std::ostream &out, BigInteger<I...>) {     return ((out << std::setfill('0') << I << std::setw(lgBase)), ...); }  int main() {     std::cout << typename BI_Fact<20>::type() << std::endl; }

如果将BI_Fact<20>改为BI_Fact<1000>后，我们可爱的Clang解释器花了3秒多的时间很偷税地算出来了1000! =

402387260077093773543702433923003985719374864210714632543799910429938512398629020592044208486969404800479988610197196058631666872994808558901323829669944590997424504087073759918823627727188732519779505950995276120874975462497043601418278094646496291056393887437886487337119181045825783647849977012476632889835955735432513185323958463075557409114262417474349347553428646576611667797396668820291207379143853719588249808126867838374559731746136085379534524221586593201928090878297308431392844403281231558611036976801357304216168747609675871348312025478589320767169132448426236131412508780208000261683151027341827977704784635868170164365024153691398281264810213092761244896359928705114964975419909342221566832572080821333186116811553615836546984046708975602900950537616475847728421889679646244945160765353408198901385442487984959953319101723355556602139450399736280750137837615307127761926849034352625200015888535147331611702103968175921510907788019393178114194545257223865541461062892187960223838971476088506276862967146674697562911234082439208160153780889893964518263243671616762179168909779911903754031274622289988005195444414282012187361745992642956581746628302955570299024324153181617210465832036786906117260158783520751516284225540265170483304226143974286933061690897968482590125458327168226458066526769958652682272807075781391858178889652208164348344825993266043367660176999612831860788386150279465955131156552036093988180612138558600301435694527224206344631797460594682573103790084024432438465657245014402821885252470935190620929023136493273497565513958720559654228749774011413346962715422845862377387538230483865688976461927383814900140767310446640259899490222221765904339901886018566526485061799702356193897017860040811889729918311021171229845901641921068884387121855646124960798722908519296819372388642614839657382291123125024186649353143970137428531926649875337218940694281434118520158014123344828015051399694290153483077644569099073152433278288269864602789864321139083506217095002597389863554277196742822248757586765752344220207573630569498825087968928162753848863396909959826280956121450994871701244516461260379029309120889086942028510640182154399457156805941872748998094254742173582401063677404595741785160829230135358081840096996372524230560855903700624271243416909004153690105933983835777939410970027753472000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000

无人值守自动迭代版本

       #include <iostream> #include <numeric> #include <vector> #include <functional> int main() {  std::vector<int> v(std::atoi(std::end(__DATE__) - (__LINE__) / 2)); // 2020年，第六行  std::iota(v.begin(), v.end(), 1);  std::cout << std::accumulate(v.begin(), v.end(), 1ull, std::multiplies<>()) << std::endl; }

模板元编程版本（二进制）

       #include <iostream> #include <bitset> #include <type_traits> using Zero = std::integer_sequence<bool>; using One = std::integer_sequence<bool, true>; template<class T> struct type_identity : std::enable_if<true, T> {}; // *2 template<class T> struct shl; template<bool...B> struct shl<std::integer_sequence<bool, B...>> : type_identity<std::integer_sequence<bool, false, B...>> {}; // *2+1 template<class T> struct shl_inc; template<bool...B> struct shl_inc<std::integer_sequence<bool, B...>> : type_identity<std::integer_sequence<bool, true, B...>> {}; // /2 template<class T> struct shr; template<bool First, bool...Rest> struct shr<std::integer_sequence<bool, First, Rest...>> : type_identity<std::integer_sequence<bool, Rest...>> {}; // +1 template<class T> struct inc; template<> struct inc<Zero> : type_identity<One> {}; template<bool...B> struct inc<std::integer_sequence<bool, true, B...>> : shl<typename inc<std::integer_sequence<bool, B...>>::type> {}; template<bool...B> struct inc<std::integer_sequence<bool, false, B...>> : type_identity<std::integer_sequence<bool, true, B...>> {}; // -1 template<class T> struct dec; template<bool...B> struct dec<std::integer_sequence<bool, true, B...>> : type_identity<std::integer_sequence<bool, false, B...>> {}; template<bool...B> struct dec<std::integer_sequence<bool, false, B...>> : shl_inc<typename dec<std::integer_sequence<bool, B...>>::type> {}; // 取最低位 template<class T> struct lowest : std::false_type {}; template<bool First, bool...Rest> struct lowest<std::integer_sequence<bool, First, Rest...>> : std::bool_constant<First> {}; // 扩展位数 template<class T> struct ext; template<bool...B> struct ext<std::integer_sequence<bool, B...>> : type_identity<std::integer_sequence<bool, B..., false>> {}; // 比较两个数是否相等 template<class A, class B> struct test; template<bool...B1, bool...B2> struct test<std::integer_sequence<bool, B1...>, std::integer_sequence<bool, B2...>> : std::bool_constant<(... || (B1 || B2)) > {}; // 计算位数大小 template<class T> struct len; template<> struct len<Zero> : std::integral_constant<std::size_t, 0> {}; template<bool...B> struct len<std::integer_sequence<bool, B...>> : std::integral_constant<std::size_t, sizeof...(B)> {}; // 去除多余的高位 template<class T, bool not_empty = test<T, T>::value> struct shrink : type_identity<Zero> {}; template<bool...B> struct shrink<std::integer_sequence<bool, false, B...>, true> : shl<typename shrink<std::integer_sequence<bool, B...>>::type> {}; template<bool...B> struct shrink<std::integer_sequence<bool, true, B...>, true> : shl_inc<typename shrink<std::integer_sequence<bool, B...>>::type> {}; // 实现超前进位加法器 template<class A, class B, bool loop = test<B, B>::value> struct add_impl : type_identity<A> {}; template<bool...B1, bool...B2> struct add_impl<std::integer_sequence<bool, B1...>, std::integer_sequence<bool, B2...>, true>  : add_impl<std::integer_sequence<bool, (B1^ B2)..., false>, std::integer_sequence<bool, false, (B1& B2)...>> {}; // 加法时处理位数不同的情况 template<class A, class B, bool = (len<A>::value > len<B>::value)> struct add_fill2 : add_impl<A, B> {}; template<class A, class B> struct add_fill2<A, B, true> : add_fill2<A, typename ext<B>::type>{}; template<class A, class B, bool = (len<A>::value < len<B>::value)> struct add : add_fill2<A, B> {}; template<class A, class B> struct add<A, B, true> : add<typename ext<A>::type, B> {}; // 实现乘法器 template<class A, class B, bool add = lowest<B>::value> struct mul; template<class A> struct mul<A, Zero, false> : type_identity<Zero> {}; template<class A, class B> struct mul<A, B, false> : mul<typename shl<A>::type, typename shr<B>::type> {}; template<class A, class B> struct mul<A, B, true> : add<typename mul<typename shl<A>::type, typename shr<B>::type>::type, A> {}; // 计算阶乘 template<class T> struct greater_than_one : test<typename dec<T>::type, typename dec<T>::type> {}; template<class T> struct fact : shrink<typename mul<T, typename fact<typename shrink<typename dec<T>::type>::type>::type>::type> {}; template<> struct fact<One> : type_identity<One> {}; // 转换为bitset输出 template<bool...B, std::size_t...I> auto ToBitSet(std::integer_sequence<bool, B...> b, std::integer_sequence<std::size_t, I...>) {  std::bitset<sizeof...(B)> ret;  (..., ret.set(I, B));  return ret; } template<bool...B> auto ToBitSet(std::integer_sequence<bool, B...> b) {  return ToBitSet(b, std::make_index_sequence<sizeof...(B)>()); }  int main() {  using F20 = fact<std::integer_sequence<bool, false, false, true, false, true>>::type;  std::cout << ToBitSet(F20()).to_ullong() << std::endl;  //using F31 = fact<std::integer_sequence<bool, true, true, true, true, true>>::type;  //std::cout << ToBitSet(F31()).to_string() << std::endl; }

注：目前极限是计算到31的阶乘，使用MSVC编译31的阶乘需30G内存。

更新：

2020-1-12 将“历史学家”版本修改为void main(void)更具有历史气息

2020-1-16 将“快速”二分版本修改为「伪」递归调用

2020-1-18 并行版本

2020-2-22 宏元编程版本

2020-2-24 模板元编程版本（大整数）

2020-3-14 无人值守自动迭代版本

2021-10-12 模板元编程版本（二进制）

无所谓优雅。

       #include <stdio.h> int main() {          printf("%Lf 
", (long double)1.0*2*3*4*5*6*7*8*9*10*11*12*13*14*15*16*17*18*19*20);     return 0; }

       2432902008176640000.000000  [wenxue@dad5600 _CPP_]$

在GNU C编译器中，long double 在x86处理器上是80位的扩展精度，而不考虑用于该类型的物理存储（可以是96位或128位），在其他一些架构上，long double 可以是 double

double（例如PowerPC）或128位四倍精度（例如SPARC[19]）。从gcc 4.3开始，四倍精度(quadruple precision )在x86上也被支持，但作为非标准类型 __float128 而不是long double。

虽然x86架构，特别是x86上的x87浮点指令，支持80位扩展精度的操作，但是可以配置处理器自动将操作四舍五入到 double 双精度（甚至是单精度）。相反，在扩展精度模式下，即使最终结果以较低的精度存储（即 FLT_EVAL_METHOD == 2），扩展精度也可用于中间编译器生成的计算。在Linux上的gcc，80位扩展精度是默认的；在几个BSD操作系统(FreeBSD和OpenBSD)上，double 双精度模式是默认的，long double 操作实际上被降低到双精度 (然而，NetBSD 7.0及以后的版本，默认为80位扩展精度)。

在单个程序中可以通过FLDCW "浮点加载控制字 "指令来覆盖这一点。在x86_64上，BSD默认为80位扩展精度。微软的 Windows 和 Visual C++ 也默认将处理器设置为 double 双精度模式，但是这也可以在单个程序中被覆盖（例如，通过Visual C++中的 _controlfp_s 函数）。另一方面，Intel C++ Compiler for x86默认启用扩展精度模式。在IA-32 OS X上，long double 数是80位扩展精度。

例二

       #include<iostream> // #include<math.h> using namespace std;  size_t factorial(size_t n);  int main() {     cout << "x! of " << 20 << " = " << factorial( (size_t)20 )  <<"
";     return 0; }  size_t factorial(size_t n) {     if(n > 1.0)         return n * factorial(n - 1);     else         return (size_t)1; }   x! of 20 = 2432902008176640000 [wenxue@dad5600 _CPP_]$

例三

       #include <iostream> using namespace std;  int main() {     size_t n;     size_t fact = 1;      cout << "Please enter a positive integer: 
";     cin >> n;      if (n < 0)         cout << "Crap! don't give me negative. 
";     else {         for(size_t i = 1; i <= n; ++i) {             fact *= i;         }         cout << "Factorial of " << n << " = " << fact <<" 
";         }      return 0; }   Please enter a positive integer:  20 Factorial of 20 = 2432902008176640000  [wenxue@dad5600 _CPP_]$

参考

       https://en.wikipedia.org/wiki/Long_double

我来提供两个CUDA并行版本的。

方法1：

       #include <iostream>  typedef unsigned long long int ull; const int N = 20;  __device__ ull atomicMul(ull* address, ull val) {     ull *address_as_ull = (ull *)address;     ull old = *address_as_ull, assumed;     do {         assumed = old;         old = atomicCAS(address_as_ull, assumed, val * assumed);     } while (assumed != old);     return old; }  __global__ void mul(ull *x) {     ull i = threadIdx.x + 1;     atomicMul(x, i); }  int main() {     ull *x;     cudaMallocManaged(&x, sizeof(ull));     x[0] = 1;     mul<<<1, N>>>(x);     cudaDeviceSynchronize();     std::cout << x[0] << std::endl;     cudaFree(x);     return 0; }

利用原子乘操作，由于CUDA里没有原子乘这个操作（有原子加atomicAdd和原子减atomicSub），所以需要利用atomicCAS函数来手动实现一个原子乘atomicMul。

方法2：

       #include <iostream>  typedef unsigned long long int ull; const int N = 20; const int WARP_SIZE = 32;  __global__ void mul(ull *x) {     int i = threadIdx.x;     ull val = x[i];     for (int mask = WARP_SIZE / 2; mask > 0; mask >>= 1)         val *= __shfl_xor_sync(WARP_SIZE - 1, val, mask, WARP_SIZE);     x[i] = val; }  int main() {     ull *x;     cudaMallocManaged(&x, WARP_SIZE * sizeof(ull));     for (int i = 0; i < WARP_SIZE; ++i)         x[i] = i < N ? i + 1 : 1;     mul<<<1, WARP_SIZE>>>(x);     cudaDeviceSynchronize();     std::cout << x[0] << std::endl;     cudaFree(x);     return 0; }

利用warpReduce操作，在log时间内对warp内连续32个元素进行乘积。

感谢 @NekoDaemon 提供的方法2进一步优化版本，无需开辟数组存储元素，在计算时直接算出对应元素即可：

       #include <iostream>  typedef unsigned long long int ull; const int N = 20; const int WARP_SIZE = 32;  __global__ void mul(ull *x) {     int i = threadIdx.x;     ull val = i < N ? i + 1 : 1;     for (int mask = WARP_SIZE / 2; mask > 0; mask >>= 1)         val *= __shfl_xor_sync(WARP_SIZE - 1, val, mask, WARP_SIZE);     if (!i) x[i] = val; }  int main() {     ull *x;     cudaMallocManaged(&x, sizeof(ull));     mul<<<1, WARP_SIZE>>>(x);     cudaDeviceSynchronize();     std::cout << x[0] << std::endl;     cudaFree(x);     return 0; }

运行：

       nvcc run.cu -o run ./run

输出结果：

       2432902008176640000

如何优雅地利用c++编程从1乘到20？

网友意见

方法1：

方法2：

类似的话题

如何优雅地利用c++编程从1乘到20？

如何优雅地利用旧 iPhone？

放弃本省211来到上海普通一本，都说要去大地方，在校园生活中如何最大化地利用地域优势来提升自己？

如何优雅地吃方便面？

如何优雅地炫富？

如何优雅地撩汉？

如何优雅地测量一只猫的体积，而不使其感到惊恐或受到伤害？

如何优雅地使用 Windows？

如何优雅地布置 iPhone 桌面？

如何优雅地像《黑暗之魂》中一样翻滚？

如何优雅地将唐僧骗出金箍棒划的圈？

如何优雅地表达「我化学没学好」？

如何优雅地接「夜阑卧听风吹雨，铁马是你，冰河也是你」？

如何优雅地在知乎上找到优质女友？

如何优雅地表白?

如何优雅地看美女？

如何优雅地吐槽自己的母校？

如何优雅地反抗语言暴力？

如何优雅地哭穷？

如何优雅地剥出一颗好看的栗子？

如何优雅地利用c++编程从1乘到20？

网友意见

方法1：

方法2：

类似的话题

如何优雅地利用c++编程从1乘到20？

如何优雅地利用旧 iPhone？

放弃本省211来到上海普通一本，都说要去大地方，在校园生活中如何最大化地利用地域优势来提升自己？

如何优雅地吃方便面？

如何优雅地炫富？

如何优雅地撩汉？

如何优雅地测量一只猫的体积，而不使其感到惊恐或受到伤害？

如何优雅地使用 Windows？

如何优雅地布置 iPhone 桌面？

如何优雅地像《黑暗之魂》中一样翻滚？

如何优雅地将唐僧骗出金箍棒划的圈？

如何优雅地表达「我化学没学好」？

如何优雅地接 「夜阑卧听风吹雨，铁马是你，冰河也是你 」？

如何优雅地在知乎上找到优质女友？

如何优雅地表白?

如何优雅地看美女？

如何优雅地吐槽自己的母校？

如何优雅地反抗语言暴力？

如何优雅地哭穷？

如何优雅地剥出一颗好看的栗子？

如何优雅地接「夜阑卧听风吹雨，铁马是你，冰河也是你」？