Python broadcasting如何实现数组维度自动扩展？-杰瑞科技汇

Of course! Let's dive deep into Python broadcasting, a powerful feature of the NumPy library.

（图片来源网络，侵删）

What is Broadcasting?

In simple terms, broadcasting is a set of rules that allows NumPy to perform operations on arrays of different shapes.

Imagine you have a small array and a large array. You want to perform an operation (like addition) on them element-wise. Normally, this would fail because their shapes don't match. Broadcasting "stretches" or "repeats" the smaller array across the larger one so they have compatible shapes, allowing the operation to proceed without making unnecessary copies of the data.

This is incredibly memory and computationally efficient.

The Core Idea: Shape Compatibility

Broadcasting works by comparing the shapes of two arrays, element by element, starting from the trailing (rightmost) dimension and moving left.

（图片来源网络，侵删）

Two dimensions are compatible if:

They are equal, or
One of them is 1.

If all dimensions are compatible, the arrays can be broadcast. If they are not, NumPy will raise a ValueError.

How Broadcasting Works: The Step-by-Step Rules

Let's break down the exact process NumPy follows. Consider two arrays, A and B.

Rule 1: Add leading "1"s to the shape of the smaller array. If the arrays have a different number of dimensions, prepend the shape of the smaller array with 1s until they have the same number of dimensions.

Rule 2: Compare dimensions. Compare the dimensions of the two arrays element by element (from right to left). If they are not equal, check if one of them is 1. If so, the array with dimension 1 is "stretched" to match the other dimension.

Rule 3: Create the output shape. The shape of the resulting array is the maximum of the two shapes for each dimension.

Rule 4: Perform the operation. The operation is performed as if the smaller array had been "repeated" (or tiled) to match the final shape, but without actually creating a new, large array in memory.

Visual Examples

Let's see this in action with a few common scenarios.

Example 1: Scalar and a 2D Array (The Simplest Case)

This is the most intuitive example. A scalar is just an array with shape .

import numpy as np
# A scalar (shape ())
scalar = 5
# A 2D array (shape (3, 4))
arr = np.array([[1, 2, 3, 4],
                [5, 6, 7, 8],
                [9, 10, 11, 12]])
# Rule 1: Shapes are () and (3, 4). Add leading 1 to scalar: (1,) -> (1, 1)
# Rule 2: Compare (1, 1) with (3, 4).
#   - Dim 2: 1 vs 4 -> Compatible. Stretch 1 to 4.
#   - Dim 1: 1 vs 3 -> Compatible. Stretch 1 to 3.
# Rule 3: Output shape is (3, 4).
result = scalar + arr
print(result)

Output:

[[ 6  7  8  9]
 [10 11 12 13]
 [14 15 16 17]]

Visualization: The scalar 5 is conceptually "stretched" into a (3, 4) array filled with 5s, and then added to arr element-wise.

  5  -->  [[5 5 5 5]
           [5 5 5 5]
           [5 5 5 5]]
[[1 2 3 4]   +  [[5 5 5 5]   =   [[ 6  7  8  9]
 [5 6 7 8]      [5 5 5 5]        [10 11 12 13]
 [9 10 11 12]]   [5 5 5 5]]       [14 15 16 17]]

Example 2: 1D Array and a 2D Array

import numpy as np
# A 1D array (shape (4,))
row_vector = np.array([1, 0, 3, 0])
# A 2D array (shape (3, 4))
matrix = np.array([[1, 2, 3, 4],
                   [5, 6, 7, 8],
                   [9, 10, 11, 12]])
# Rule 1: Shapes are (4,) and (3, 4). Add leading 1 to row_vector: (4,) -> (1, 4)
# Rule 2: Compare (1, 4) with (3, 4).
#   - Dim 2: 4 vs 4 -> Equal. Compatible.
#   - Dim 1: 1 vs 3 -> Compatible. Stretch 1 to 3.
# Rule 3: Output shape is (3, 4).
result = matrix + row_vector
print(result)

Output:

[[ 2  2  6  4]
 [ 6  6 10  8]
 [10 10 14 12]]

Visualization: The (1, 4) row_vector is "stretched" or "tiled" down to match the (3, 4) shape of the matrix.

[[1 0 3 0]]  -->  [[1 0 3 0]
                   [1 0 3 0]
                   [1 0 3 0]]
[[1 2 3 4]   +  [[1 0 3 0]   =   [[ 2  2  6  4]
 [5 6 7 8]      [1 0 3 0]        [ 6  6 10  8]
 [9 10 11 12]]   [1 0 3 0]]       [10 10 14 12]]

Example 3: 2D Array and a 1D Array (Column Vector)

import numpy as np
# A 2D array (shape (4, 1))
col_vector = np.array([[1],
                       [2],
                       [3],
                       [4]])
# A 2D array (shape (4, 3))
matrix = np.array([[1, 2, 3],
                   [4, 5, 6],
                   [7, 8, 9],
                   [10, 11, 12]])
# Rule 1: Shapes are (4, 1) and (4, 3). No leading 1s needed.
# Rule 2: Compare (4, 1) with (4, 3).
#   - Dim 2: 1 vs 3 -> Compatible. Stretch 1 to 3.
#   - Dim 1: 4 vs 4 -> Equal. Compatible.
# Rule 3: Output shape is (4, 3).
result = matrix + col_vector
print(result)

Output:

[[ 2  3  4]
 [ 6  7  8]
 [10 11 12]
 [14 15 16]]

Visualization: The (4, 1) col_vector is "stretched" or "tiled" across to match the (4, 3) shape of the matrix.

[[1]   -->  [[1 1 1]
 [2]         [2 2 2]
 [3]         [3 3 3]
 [4]]        [4 4 4]]
[[1 2 3]   +  [[1 1 1]   =   [[ 2  3  4]
 [4 5 6]      [2 2 2]        [ 6  7  8]
 [7 8 9]      [3 3 3]        [10 11 12]
 [10 11 12]]   [4 4 4]]       [14 15 16]]

A Case That Fails: Incompatible Shapes

What if the dimensions don't match?

import numpy as np
# A 1D array (shape (3,))
x = np.array([1, 2, 3])
# A 2D array (shape (4, 2))
y = np.array([[1, 2],
              [3, 4],
              [5, 6],
              [7, 8]])
# Rule 1: Shapes are (3,) and (4, 2). Add leading 1 to x: (3,) -> (1, 3)
# Rule 2: Compare (1, 3) with (4, 2).
#   - Dim 2: 3 vs 2 -> Not equal and neither is 1. **INCOMPATIBLE!**
try:
    result = x + y
except ValueError as e:
    print(f"ValueError: {e}")

Output:

ValueError: operands could not be broadcast together with shapes (3,) (4,2)

Why is Broadcasting Important?

Memory Efficiency: It avoids creating large, intermediate arrays. In the row_vector + matrix example, NumPy doesn't actually create a full (3, 4) copy of the row vector. It performs the calculation using the original small array and a clever indexing scheme, saving significant RAM.
Performance: By avoiding data duplication, broadcasting leads to faster computations because it operates on the original, contiguous data blocks in memory.
Code Readability and Conciseness: It allows you to write clean, vectorized code instead of slow, verbose Python loops. You express the operation at a high level, and NumPy handles the details efficiently.

Common Use Cases

Normalization: Subtracting the mean of a dataset from each data point.
Adding a Bias: In neural networks, adding a bias vector to each row of a weight matrix.
Feature Scaling: Applying a scaling factor to each feature (column) in a dataset.
Plotting Data: Creating a grid of x and y coordinates from 1D vectors to evaluate a function over a 2D plane.

# Example: Centering a dataset by subtracting its mean
data = np.array([[1, 2, 3],
                 [4, 5, 6],
                 [7, 8, 9]])
# Mean of each column (shape (3,))
mean_per_feature = np.mean(data, axis=0)
print("Original Data:\n", data)
print("\nMean per feature:\n", mean_per_feature)
# Broadcasting the mean across all rows
centered_data = data - mean_per_feature
print("\nCentered Data:\n", centered_data)
# Notice each column now has a mean of 0

Python broadcasting如何实现数组维度自动扩展？

What is Broadcasting?

The Core Idea: Shape Compatibility

How Broadcasting Works: The Step-by-Step Rules

Visual Examples

Example 1: Scalar and a 2D Array (The Simplest Case)

Example 2: 1D Array and a 2D Array

Example 3: 2D Array and a 1D Array (Column Vector)

A Case That Fails: Incompatible Shapes

Why is Broadcasting Important?

Common Use Cases

99ANYc3cd6

Java readline读取如何正确处理行尾？

Java如何用RabbitMQ实现消息队列？

泰格至尊商业管理软件教程怎么用？

佳能相机怎么用？视频教程教你快速上手。

Python randint 范围如何正确设置？

Java Socket编程实例具体怎么实现？

Python中sorted函数的cmp参数如何使用？

mac上Python urllib怎么用？

Python如何执行mysqldump命令？

电脑店U盘启动盘怎么制作？

系统规划与管理师视频教程怎么选？

Python版CodeTank如何实现核心逻辑？

Python包管理工具有哪些？如何选择？

Java如何实现文件上传到FTP服务器？

Python commands变量如何使用？

Python爬虫实例教程，如何快速上手？

Python broadcasting如何实现数组维度自动扩展？

What is Broadcasting?

The Core Idea: Shape Compatibility

How Broadcasting Works: The Step-by-Step Rules

Visual Examples

Example 1: Scalar and a 2D Array (The Simplest Case)

Example 2: 1D Array and a 2D Array

Example 3: 2D Array and a 1D Array (Column Vector)

A Case That Fails: Incompatible Shapes

Why is Broadcasting Important?

Common Use Cases

相关推荐

Java Socket编程实例具体怎么实现？