Design and Realization of an Artificial Intelligence Experimental Platform Using a Chinese Chess Robot

The pursuit of creating machines that can engage in and master complex games has long been a cornerstone of artificial intelligence research. As a domain that encapsulates strategic planning, decision-making under uncertainty, and adversarial reasoning, machine game-playing serves as an ideal testbed for advanced AI algorithms. Among the pantheon of strategic games, Chinese Chess (Xiangqi) stands out for its rich history and profound complexity, offering a unique and challenging environment for computational intelligence. This article details the comprehensive design and implementation of a physical, interactive Chinese Chess robot, positioning it not merely as a game-playing entity but as a versatile, multi-disciplinary experimental platform. The system, which we refer to as this China robot, integrates mechanics, electronics, computer vision, and sophisticated AI into a single cohesive unit, enabling tangible human-machine interaction over a physical board.

The core philosophy behind this China robot platform is embodied in its dual-system control architecture. This bifurcation separates high-level cognitive functions from low-level actuation tasks, mirroring a simplified paradigm of intelligent biological systems. The high-level cognitive tasks, which include perceiving the game state and determining the optimal strategic move, are handled by an upper computer system. This upper computer is typically a Raspberry Pi or a desktop PC, chosen for its sufficient computational power. Conversely, the precise physical manipulation of the chess pieces is delegated to a lower computer system. This division of labor allows each subsystem to be optimized for its specific role: the upper computer for complex algorithm execution and the lower computer for reliable, real-time motor control.

The operational workflow of the China robot is a continuous loop of perception, decision, and action. It begins with perception: a camera mounted directly above the board captures an image of the current game state. The upper computer processes this image using computer vision algorithms to identify the positions and types of all pieces. This perceived state is then fed into a Chinese Chess engine—a specialized program that embodies the game’s rules and strategic knowledge. The engine calculates the best possible move for the China robot. Once a move is selected (e.g., “move cannon from coordinate (2, 3) to (2, 7)”), the upper computer translates this abstract command into a series of low-level motion instructions. These instructions are sent via a serial communication protocol to the lower computer. The lower computer, in turn, drives the motors of the robotic mechanism to execute the move, physically picking up the designated piece and placing it on the target square. Throughout this process, the system provides auditory feedback via speech synthesis, announcing its moves and game events, thereby enhancing the user’s immersive experience.

Mechanical Design and Actuation System

The physical embodiment of the China robot is a Cartesian (or Gantry) robot. This design was selected due to the inherently planar and grid-based nature of the Chinese Chess board. A Cartesian robot provides motion along three orthogonal axes, labeled X, Y, and Z, which perfectly maps to the need for positioning over the board’s surface and performing vertical pick-and-place operations.

X and Y Axes (Planar Positioning): These axes are responsible for moving the end-effector to any square on the board. Precision is paramount here, as the system must reliably position itself over the center of a specific square, each typically measuring around 40-50mm. To achieve this, each axis is driven by a NEMA 17 (42mm) stepper motor. The motor rotates a synchronous pulley, which engages with a timing belt that moves the carriage along a linear rail. The high precision comes from the stepper motor’s ability to move in discrete, precise angular steps (e.g., 1.8° per full step). With microstepping drivers, even finer control is possible. The fundamental relationship between motor steps and linear displacement is given by:

$$ \Delta L = \frac{\pi \cdot D \cdot \theta}{360 \cdot N_{micro} \cdot R}$$

Where:

$\Delta L$ = Linear displacement per motor step (mm)
$D$ = Diameter of the synchronous pulley (mm)
$\theta$ = Stepper motor’s basic step angle (degrees, typically 1.8°)
$N_{micro}$ = Microstepping factor (e.g., 1 for full step, 16 for 1/16 microstep)
$R$ = Gear reduction ratio (if any)

For a pulley diameter of 22.3mm, a basic step angle of 1.8°, and a 1/16 microstepping driver, the resolution $\Delta L$ can be calculated, ensuring sub-millimeter accuracy.

Z Axis (Vertical Actuation): This axis controls the end-effector’s vertical movement for picking up and releasing pieces. The requirement here is less about ultra-high precision and more about sufficient force and reliable travel. A linear actuator (or “push-rod” DC motor) is employed. It provides a fixed, repeatable stroke (e.g., 10-15mm), which is adequate to lift a piece clear of others on the board. Attached to the end of this actuator is the end-effector itself: a miniature electromagnetic solenoid (electromagnet). When energized, the electromagnet generates a magnetic field strong enough to hold a metal-cored chess piece securely. When de-energized, the piece is released. This simple, non-contact grasping method is highly effective for the standardized pieces.

The mechanical design of this China robot ensures robustness, repeatability, and quiet operation, making it suitable for laboratory and demonstration environments.

Table 1: Key Mechanical Specifications of the China Robot
Component	Specification	Purpose/Rationale
Frame Structure	Aluminum Extrusion	Provides rigidity, lightweight, and ease of assembly.
X/Y Axis Drive	NEMA 17 Stepper Motor + Timing Belt	Ensures precise, backlash-free planar positioning.
Linear Motion	Precision Linear Rails & Bearings	Guarantees smooth, accurate, and low-friction movement.
Z Axis Actuator	12V DC Linear Actuator	Provides controlled vertical motion for pick-and-place.
End-Effector	Miniature Electromagnet	Simple, reliable method for grasping metal-cored pieces.
Positioning Resolution	< 0.5 mm	Sufficient to reliably center over any board square.

Hardware Circuit and Low-Level Control System

The “nervous system” of the China robot is its electronic control circuit, centered around a dedicated microcontroller unit (MCU) acting as the lower computer. The primary responsibilities of this subsystem are: interpreting high-level commands from the upper computer, generating precise pulse sequences to drive the stepper motors, controlling the power to the linear actuator and electromagnet, and managing system initialization procedures like homing.

The heart of the circuit is an STM32F103RCT6 microcontroller, an ARM Cortex-M3 based chip known for its balance of performance, peripheral set (multiple timers, USART, GPIO), and cost-effectiveness—a common choice in embedded systems for China robot projects. It runs a firmware program written in C, which constitutes the real-time control kernel.

The motor control is achieved through dedicated driver modules. For the X and Y stepper motors, A4988 or DRV8825 stepper driver modules are used. These modules accept simple STEP and DIRECTION (DIR) signals from the MCU. Each pulse on the STEP line commands the motor to move one microstep. The DIR line controls the rotation direction. The MCU’s programmable timers are configured to generate these pulse trains with precise frequencies, thereby controlling the speed of the axes. The current supplied to the motors (set by a potentiometer on the driver) determines the holding and dynamic torque. The Z-axis linear actuator, being a DC motor, is controlled via an H-bridge circuit or a motor driver module (like an L298N), allowing the MCU to apply voltage in either polarity to extend or retract the actuator, and to cut power to hold position.

The electromagnet is driven by a simple transistor switch (e.g., a MOSFET), as it only requires ON/OFF control. A flyback diode is critically placed in parallel with the electromagnet coil to suppress voltage spikes generated when the current is suddenly switched off, protecting the transistor and the MCU.

Communication with the upper computer is established via a universal asynchronous receiver-transmitter (UART) serial protocol, typically over USB-to-serial converters for convenience. A simple, robust communication protocol is defined. For example, a move command from the upper computer might be formatted as a string: "MOVE,X1,Y1,X2,Y2\n", where (X1,Y1) are the coordinates of the piece to move, and (X2,Y2) are the destination coordinates. The lower computer parses this string, plans the motion trajectory (often a simple point-to-point move), and executes it step-by-step, sending status updates (like “DONE” or “ERROR”) back to the upper computer upon completion.

Table 2: Core Hardware Modules of the Control System
Module	Core Component/IC	Primary Function in the China Robot
Main Controller	STM32F103RCT6 MCU	Central processing unit; executes control firmware, manages I/O and communication.
Stepper Driver	A4988 or DRV8825	Translates MCU step/dir signals into phased coil currents for X/Y stepper motors.
DC Motor Driver	L298N or TB6612FNG	Provides bidirectional current control for the Z-axis linear actuator.
Power Regulation	LM2596 (Switching Regulator)	Steps down input voltage (e.g., 12V) to stable 5V or 3.3V for MCU and logic circuits.
Serial Communication	CH340G (USB-UART Bridge)	Facilitates communication between the STM32 and the upper computer (PC/RPi).

Software Architecture: Vision, Intelligence, and Integration

The intelligence of the China robot resides in the software running on the upper computer. This software stack is multifaceted, integrating computer vision for perception, a game engine for cognition, and a control interface for action.

1. Board State Recognition via Computer Vision

The first task is to accurately perceive the physical board. Using the OpenCV library in Python, the system performs the following pipeline:

Image Acquisition & Preprocessing: A frame is captured from the USB camera. The image is undistorted using a pre-calibrated camera model to correct lens distortions. It is then cropped and perspective-transformed to obtain a top-down, warped view of just the chessboard area, simplifying all subsequent processing.
Color-Based Piece Detection: The board has two distinct piece colors: red and black (or blue). The image is converted from BGR to HSV color space, which is more robust to lighting variations. Using predefined HSV range thresholds, masks for “red” and “black” pixels are created.
$$ M_{red}(x,y) = \begin{cases} 1 & \text{if } H_{low,r} \leq H(x,y) \leq H_{high,r} \text{ and } S_{low,r} \leq S(x,y) \leq S_{high,r} \text{ and } V_{low,r} \leq V(x,y) \leq V_{high,r} \\ 0 & \text{otherwise} \end{cases} $$
A similar mask $M_{black}$ is created. Contours are found on these masks, and those with an area within an expected range are identified as potential pieces.
Grid Localization & Piece Positioning: To determine which square a piece occupies, the software must know the grid’s geometry. This can be done during an initial calibration step or by detecting the board’s outer lines and internal grid using the Hough Line Transform. The board is divided into a 9×10 logical grid. The centroid $(c_x, c_y)$ of each detected piece contour is computed and mapped to a grid coordinate $(i, j)$ using:
$$ i = \left\lfloor \frac{c_x – x_{grid\_origin}}{w_{cell}} \right\rfloor, \quad j = \left\lfloor \frac{c_y – y_{grid\_origin}}{h_{cell}} \right\rfloor $$
where $w_{cell}$ and $h_{cell}$ are the width and height of a single grid cell.
Change Detection: To identify the opponent’s move, the current perceived board state is compared with the previous known state. The difference pinpoints the “from” and “to” squares of the human player’s move, which is then translated into a standard notation and fed to the game engine.

Table 3: Comparison of Vision Processing Techniques for the China Robot
Technique	Application in This China Robot	Advantages	Challenges
HSV Color Segmentation	Distinguishing red and black pieces.	Robust to brightness changes; simple to implement.	Requires careful threshold tuning; sensitive to strong colored ambient light.
Contour Analysis	Finding individual pieces from color masks.	Provides precise centroid and area.	Can fail if pieces are touching or if lighting creates shadows that merge contours.
Hough Circle Transform	Alternative method for detecting round pieces.	Geometrically precise.	Computationally heavier; parameter-sensitive (min/max radius).
Template Matching	Could be used for piece type recognition (General, Horse, etc.).	Potentially high accuracy for type ID.	Requires multiple templates; sensitive to scale and rotation; not implemented in basic version.

2. The Chinese Chess Engine: The AI Core

The game-playing intelligence is encapsulated in a Chinese Chess engine, typically written in C++ for performance. Its architecture consists of four fundamental pillars:

Board Representation: How the game state is stored in memory. A common efficient method is the “mailbox” array (a 10×9 or 1×90 array) where each element corresponds to a board square and holds a code for the piece occupying it (e.g., RED_ROOK, BLACK_HORSE, EMPTY). Bitboards, while prevalent in Western chess, are less common in Xiangqi due to the board’s geometry and the river/castle rules, but advanced engines may use specialized variants.
Move Generation: A function that, given a board position, produces a list of all legal moves for the side to move. This is rule-based, involving checking the unique movement patterns of each piece type (e.g., the Elephant moves exactly two points diagonally and cannot cross the river), ensuring the General is not left in check, and adhering to rules like perpetual check restrictions.
Position Evaluation: A function $f(P)$ that assigns a numerical score to any board position $P$, estimating which side is ahead and by how much. It is a weighted sum of heuristic features:
$$ f(P) = w_1 \cdot (Material_{red} – Material_{black}) + w_2 \cdot (Mobility_{red} – Mobility_{black}) + w_3 \cdot (KingSafety_{red} – KingSafety_{black}) + … $$
Here, $w_i$ are weights learned through self-play or manual tuning. Material values are fundamental (e.g., Chariot=900, Cannon=450, Horse=400). The evaluation function is the engine’s “intuition.”
Search Algorithm: The mechanism that explores future moves to find the best one. The Minimax algorithm is the foundation, assuming the opponent will play optimally to minimize the engine’s score. The depth of the search tree is crucial. A naive Minimax search explores $b^d$ nodes, where $b$ is the average branching factor (~40 in Xiangqi) and $d$ is the search depth. This explodes exponentially. Therefore, the Alpha-Beta Pruning algorithm is employed. It achieves the same result as Minimax but prunes away branches that cannot possibly influence the final decision, drastically reducing the number of nodes evaluated. The performance gain can be quantified. Let the number of nodes examined by Minimax be $N_{minimax} = b^d$. With perfect move ordering, Alpha-Beta examines approximately $N_{\alpha\beta} \approx b^{d/2}$ nodes. For $b=40$ and $d=6$:
$$ N_{minimax} \approx 40^6 = 4.096 \times 10^9 \text{ nodes} $$
$$ N_{\alpha\beta} \approx 40^{3} = 6.4 \times 10^4 \text{ nodes} $$
This represents a speedup factor of about 64,000, making deep searches feasible. The search is often enhanced with techniques like iterative deepening, transposition tables (to cache results of previously seen positions), and quiescence search (to avoid horizon effect).

The engine’s strength in this China robot is directly controlled by the search depth parameter $d$. Setting $d=4$ results in a relatively quick, weaker “beginner” mode, while $d=8$ or higher produces a much stronger “expert” level play, albeit with longer thinking time. This configurability is a key feature of the platform.

Table 4: Core Components of the Chinese Chess Engine
Component	Description & Implementation	Mathematical/Algorithmic Basis
Board Representation	“Mailbox” 10×9 array. Each cell stores an integer code for the piece (e.g., 1 for Red Chariot, -5 for Black Cannon).	$board[9][10] \in \mathbb{Z}$
Move Generation	Rule-based functions for each piece type, checking legality (bounds, blockers, check). Returns a list of (from_sq, to_sq) pairs.	Procedural logic based on Xiangqi rules.
Evaluation Function	Weighted sum of material, piece activity, king safety, pawn structure, and space control.	$Eval(P) = \sum_{i} w_i \cdot f_i(P)$; $f_i$ are feature functions, $w_i$ are weights.
Search Algorithm	Alpha-Beta Pruning with iterative deepening and transposition tables.	$\text{alphaBeta}(depth, \alpha, \beta) = \max/\min(\text{eval}, \text{alphaBeta}(depth-1, …))$. Prunes if $\alpha \geq \beta$.
Performance Metric	Nodes Per Second (NPS), search depth reached, and playing strength (Elo rating).	Higher NPS and deeper effective search depth correlate strongly with higher Elo.

3. System Integration and Control Flow

The upper computer software, potentially written in Python, acts as the maestro, orchestrating all components. Its main loop follows this sequence:

1.  Initialize: Establish serial connection with STM32, load opening book, calibrate camera if needed.
2.  Recognize initial board setup or wait for human first move.
3.  Loop until game ends:
    a.  **Human Turn:** 
        i.   Use CV to continuously monitor board.
        ii.  Detect state change -> deduce human move.
        iii. Validate move legality via engine.
        iv.  Update engine's internal board.
    b.  **AI Turn (China Robot's move):**
        i.   Engine performs search from current position.
        ii.  Selects best move (source_sq, dest_sq).
        iii. Convert board coordinates to physical X,Y coordinates (mm).
        iv.  Send command string to STM32: e.g., `"PICK,x1,y1\n"`.
        v.   Wait for "DONE" acknowledgement.
        vi.  Send command: `"PLACE,x2,y2\n"`.
        vii. Update engine's internal board and GUI.
        viii. Use text-to-speech to announce move.
4.  Announce game result (Checkmate, Draw, etc.).

The firmware on the STM32 interprets the “PICK” and “PLACE” commands. For “PICK,x,y”, it:
1. Plans a trajectory to move the end-effector to (x,y).
2. Lowers the Z-axis.
3. Activates the electromagnet.
4. Raises the Z-axis.
5. Returns to a designated “park” position to avoid collisions.
6. Sends “DONE”.

The “park and reset” strategy is critical for mitigating cumulative positioning errors inherent in open-loop stepper motor systems. By returning to a fixed, known reference point after each operation, any small error from one move is not carried forward to the next.

The China Robot as a Multifaceted AI Experimental Platform

Beyond playing chess, the true value of this integrated China robot system lies in its utility as a comprehensive, hands-on experimental platform for education and research in Artificial Intelligence and related fields.

AI & Machine Learning:
- Algorithm Testing: Students can modify the evaluation function weights ($w_i$) or inject new positional features ($f_i$) and immediately observe the impact on playing strength through physical games.
- Reinforcement Learning (RL): The platform is ideal for RL experiments. The engine can be replaced by an RL agent (e.g., using a neural network policy). The agent learns by playing thousands of games against itself or a baseline engine, with the physical robot providing a compelling demonstration of the learned policy. The state $s_t$ is the board representation, the action $a_t$ is the move, and the reward $r_t$ could be +1 for win, -1 for loss, 0 for draw, plus small material-based rewards.
- Search Algorithm Studies: Different search algorithms (Minimax, Alpha-Beta, Monte Carlo Tree Search – MCTS) can be implemented and compared in terms of nodes visited, speed, and resulting move quality.
Computer Vision:
- Robust Recognition: The challenge of reliable piece recognition under varying lighting conditions is perennial. Students can experiment with advanced techniques beyond basic HSV filtering: machine learning-based classifiers (SVM, CNN) for piece type recognition, background subtraction for change detection, or more sophisticated color constancy algorithms.
- Calibration & Geometry: Implementing and comparing different camera calibration methods (checkerboard vs. circular grid) and perspective correction algorithms provides practical insight into geometric computer vision.
Robotics & Control Systems:
- Motion Planning & Control: The simple point-to-point movement can be extended to implement trajectory planning (e.g., S-curve velocity profiles for smoother, faster motion), closed-loop control using encoder feedback on the motors (transforming it from open-loop to closed-loop), or even obstacle avoidance logic in a cluttered environment.
- System Integration: The project embodies a complete cyber-physical system, teaching the integration of sensors (camera), actuators (motors, electromagnet), communication protocols (UART), and real-time control software.
Human-Computer Interaction (HCI):
- The platform can be augmented with voice control modules. Implementing a speech recognition system to accept verbal moves (“Cannon two to five”) or commands (“New game”, “Take back move”) turns the China robot into a multimodal interaction study platform.
- The auditory feedback system can be made more interactive, providing hints or explaining the robot’s strategic reasoning.

Conclusion

The design and implementation of this Chinese Chess robot culminate in a sophisticated, yet accessible, platform that bridges the gap between abstract AI algorithms and tangible physical interaction. This China robot successfully integrates mechatronic design, embedded systems control, real-time computer vision, and advanced game-playing artificial intelligence into a single functional unit. Its dual-system architecture ensures a clean separation of concerns, promoting both stability and modularity for experimentation. The use of the Alpha-Beta pruning search algorithm within the chess engine provides a computationally efficient core that enables adjustable playing strength, making the platform engaging for users of all skill levels.

More importantly, the system transcends its primary function as a game player. It serves as a rich, multi-disciplinary experimental platform. It allows students and researchers to empirically test and visualize concepts from decision theory, search algorithms, machine learning (particularly reinforcement learning), image processing, robotic motion control, and system integration. The immediate physical feedback—seeing a robot arm execute a strategically calculated move—provides an unparalleled depth of understanding and engagement. Therefore, this China robot stands as a compelling testament to the integrative nature of modern intelligent systems and offers a powerful tool for advancing education and innovation in the field of artificial intelligence and robotics.