FX Market Edge Prediction

Rigorous Machine Learning Research in Foreign Exchange Markets

← Back to Portfolio 🔗 View on GitHub

Project Overview

This research project represents a comprehensive investigation into the predictability of USD pip movements in foreign exchange markets using state-of-the-art machine learning techniques. The study employed a hybrid architecture combining Temporal Convolutional Autoencoders (TCNAE) with LightGBM gradient boosting to analyze 3 years of hourly market data across 24 major currency pairs.

Key Finding: Despite sophisticated methodology and rigorous experimental design, no statistically significant market edge was discovered, providing valuable negative evidence that validates the efficient market hypothesis for hourly FX movements.
Python PyTorch LightGBM OANDA v20 API Docker Pandas/NumPy Temporal CNNs Financial ML

Technical Architecture

System Architecture Diagram

Complete 3-stage ML pipeline: TCNAE autoencoder → LightGBM models → USD predictions

Core Components

Dataset & Methodology

224,955
Total Samples
24
Currency Pairs
3 Years
Historical Data
32,880
Test Predictions

Data Sources & Quality

Results & Performance

Performance Analysis Results

Comprehensive performance analysis showing no statistically significant edges discovered

0.02%
Average Correlation
50.1%
Direction Accuracy
$101.80
Average RMSE
5.83%
Best Correlation (USD_CAD)

Key Findings

  • Direction accuracy clustered around 50% (random baseline) across all instruments
  • Correlation coefficients remained below 10% for all currency pairs
  • Both log returns and direct USD pip training approaches converged to identical conclusions
  • Model uncertainty appropriately reflected market unpredictability

Technical Innovation

Novel Contributions

Methodological Rigor

Scientific Value

Research Contribution

This study provides crucial negative evidence using modern ML techniques, serving as a methodological template for rigorous financial prediction research. The honest reporting of unsuccessful edge discovery attempts contributes valuable knowledge to the field by demonstrating that sophisticated technical approaches cannot overcome fundamental market efficiency.

Industry Implications

Technical Skills Demonstrated

Deep Learning Architecture Financial Data Engineering Production ML Pipelines Statistical Validation Time Series Analysis API Integration Docker Containerization Scientific Computing Research Methodology Code Quality & Testing

Repository Access

The complete codebase, experimental results, and documentation are available on GitHub. The repository includes trained models, comprehensive analysis reports, and visualization tools for full reproducibility of the research findings.

🔗 Explore the Complete Repository

Repository Features: Production-ready code, comprehensive documentation, experimental results, trained models, visualization suite, and Docker deployment configuration.