Zhou Bojian 周柏健 — ML for Business

Four featured projects across NLP/finance, time-series forecasting, reinforcement learning, and combinatorial optimization. 四个精选项目，覆盖 NLP/金融、时序预测、强化学习与组合优化。

FinBERT Fine-Tuning for Financial SentimentFinBERT 金融情感分析微调

CDS525 · Mar–Apr 2026

PyTorchTransformersFinBERTGrid Search

Problem问题Financial texts (earnings reports, analyst notes) use domain-specific language that generic sentiment models misclassify — a real cost for any analyst or investor relying on automated signals.金融文本（财报、研报）高度依赖领域词汇，通用情感模型误判率高 —— 对依赖自动化信号的分析师或投资者而言代价不菲。

Data数据Financial PhraseBank (~4.8k sentences, 3-class sentiment). Handled moderate class imbalance with weighted cross-entropy and focal loss.Financial PhraseBank 数据集（约 4800 句，3 类情感）。使用加权交叉熵与 Focal Loss 处理类别不平衡。

Approach方法Full factorial grid search: 2 loss functions × 4 batch sizes × 4 learning rates = 32 training runs, all tracked in a unified notebook with reproducible seeds.完整析因网格搜索：2 种损失函数 × 4 组 batch size × 4 组学习率 = 32 次训练，统一 notebook 可复现。

Outcome成果Best macro F1: 89.92% (Weighted CE, BS=32, LR=2e-5). Delivered unified training notebook, ~2,664-word Word report, and GitHub README.最佳 Macro F1：89.92%（加权交叉熵, BS=32, LR=2e-5）。交付统一训练 notebook、约 2664 字 Word 报告与 GitHub README。

My Role我的贡献Group project — I designed the experiment grid, implemented the unified training loop, ran all 32 configurations, and authored the analysis.小组项目 —— 我负责实验设计、统一训练循环实现、执行全部 32 组实验并撰写分析报告。

GitHub →

NAS-EA + DQN for Vehicle Routing with Time Windows基于 NAS-EA + DQN 的带时间窗车辆路径优化

CDS526 · Feb–Apr 2026

Deep RLPyTorchNeural Architecture SearchEvolutionary Algorithms

Problem问题Last-mile delivery with time-window constraints is NP-hard. Classical solvers are exact but slow; naive RL agents pick infeasible actions and train unstably.带时间窗的最后一公里配送是 NP 难问题。传统求解器精确但慢；朴素强化学习智能体频繁选择不可行动作，训练不稳定。

Data数据Solomon VRPTW benchmark instances. Customer locations, demands, time windows, and service durations.Solomon VRPTW 基准数据集：客户坐标、需求、时间窗与服务时长。

Approach方法Combined Neural Architecture Search via evolutionary algorithm with a Dueling DQN agent. Implemented action masking so infeasible actions get Q=−∞, and diagnosed depot-reset and time-window feasibility bugs across v6→v7→v8.结合进化算法驱动的神经架构搜索与 Dueling DQN。引入动作掩码使不可行动作 Q=−∞，并在 v6→v7→v8 过程中修复了 depot 重置与时间窗可行性缺陷。

Outcome成果v7 achieved mean distance 731 with 18% CV, vs v6's 948 / 77% CV — a 23% improvement in solution quality and 4× improvement in stability across 5 seeds.v7 取得平均路径长度 731、变异系数 18%，相较 v6（948 / 77%）解的质量提升 23%，稳定性提升 4 倍（5 个随机种子）。

My Role我的贡献Individual research project. Designed NAS search space, implemented masking, ran multi-seed experiments, and wrote the report.个人研究项目。负责架构搜索空间设计、掩码机制实现、多种子实验与报告撰写。

GitHub →

European Weather Forecasting: XGBoost vs BiLSTM欧洲天气预测：XGBoost 对比 BiLSTM

CDS524 · 2025–2026

XGBoostBiLSTM + AttentionTime SeriesFeature Engineering

Problem问题Temperature forecasting benchmarks usually pit tree models against deep learning with standard lag features. But can domain-informed features — in this case Chinese cosmological calendar features (24 Solar Terms, Wuxing, Heavenly Stems) — actually shift the balance? And do the two model families respond to them the same way?温度预测的常见评测是用标准滞后特征对比树模型和深度学习。但领域启发式特征 —— 这里指中国历法体系（二十四节气、五行、天干） —— 是否真能改变两者的表现平衡？两类模型对这类特征的响应是否一致？

Data数据European daily weather records with temperature, humidity, pressure, and wind features. Aligned to the Chinese solar calendar for feature engineering.欧洲日度气象数据：温度、湿度、气压、风速等。与中国农历节气体系对齐后构造特征。

Approach方法Clean 2×2 factorial design: {XGBoost, BiLSTM+Attention} × {standard lag/rolling features, Chinese cosmological features}. Statistical significance tested with paired t-tests across multiple seeds.清晰的 2×2 析因设计：{XGBoost, BiLSTM+Attention} × {标准滞后/滚动特征, 中国历法特征}。多随机种子配对 t 检验评估显著性。

Outcome成果Non-obvious finding: Chinese calendar features significantly helped XGBoost without lag features (p=0.0005) but harmed LSTM performance — suggesting the two architectures treat cyclical priors very differently. Delivered full report, GitHub README, and a 20-minute narrated PPT.非直观结论：中国历法特征对不使用滞后特征的 XGBoost 显著提升（p=0.0005），但损害 LSTM 表现 —— 表明两类架构对周期性先验的利用方式截然不同。交付完整报告、GitHub README 与 20 分钟讲稿的 PPT。

My Role我的贡献Group project. I designed the factorial experiment, engineered the Chinese-calendar feature set, built both pipelines, and ran the statistical analysis.小组项目。我负责析因实验设计、中国历法特征工程、两类流水线实现以及统计显著性分析。

GitHub →

Space Defender: Dueling DQN + Prioritized Experience ReplaySpace Defender：Dueling DQN + 优先经验回放

CDS524 · 2025

Deep RLDueling DQNPERCurriculum LearningReward Shaping

Problem问题A standard RL benchmark game — but a good testbed for a real-world issue: agents that exploit reward loopholes (e.g., "corner-hiding" to maximize survival time without engaging) rather than learning the intended behavior. How do you get stable, high-scoring, and honest play?这是一个标准强化学习基准游戏，同时是现实问题的良好载体：智能体往往会钻奖励漏洞（例如"躲角落"来延长存活时间而不参与对抗），而非学习真正期望的行为。如何训练出稳定、高分且行为符合预期的策略？

Data数据Self-generated from a custom Space Defender game environment — state, action, reward, next-state tuples sampled during rollouts across 8 training iterations.自定义 Space Defender 游戏环境自产生数据 —— 8 个训练迭代中采样的 (状态, 动作, 奖励, 下一状态) 经验元组。

Approach方法Dueling DQN with Prioritized Experience Replay, iterated v1→v8. Diagnosed reward-hacking (corner-hiding), redesigned the reward function, and introduced curriculum learning in v8 to progressively harden the enemy spawn pattern.Dueling DQN 结合优先经验回放（PER），迭代 v1→v8。诊断出奖励黑客行为（躲角落），重新设计奖励函数，并在 v8 中引入课程学习，逐步加大敌方生成难度。

Outcome成果Reached mean score ~1,960 (target 3,500). More importantly, the iteration log documents how each architectural and reward-shaping change moved the needle — a realistic record of RL debugging, not just a final number.最终平均得分约 1960（目标 3500）。更重要的是迭代日志完整记录了每次架构与奖励调整的效果 —— 真实呈现强化学习调试过程，而不只是给出一个终值。

My Role我的贡献Individual project. Full ownership: environment, agent architecture, PER implementation, reward design, and 8-version iteration.个人项目。独立完成：环境搭建、智能体架构、PER 实现、奖励设计与 8 版本迭代。

GitHub →

Turning data into business decisions with modern ML. 用现代机器学习，把数据转化为商业决策。

About Me关于我

Focus Areas专注方向

Education教育背景

Certifications认证

Languages语言

Skills技能

Technical技术能力

Business商业能力

Tools & Stack工具与栈

Projects项目

FinBERT Fine-Tuning for Financial SentimentFinBERT 金融情感分析微调

NAS-EA + DQN for Vehicle Routing with Time Windows基于 NAS-EA + DQN 的带时间窗车辆路径优化

European Weather Forecasting: XGBoost vs BiLSTM欧洲天气预测：XGBoost 对比 BiLSTM

Space Defender: Dueling DQN + Prioritized Experience ReplaySpace Defender：Dueling DQN + 优先经验回放

Resume简历

Education教育

Experience工作经历

Selected Projects精选项目

Certifications & Languages认证与语言

Contact联系方式