Claude vs GPT-5 vs Gemini: Reside Gold Buying and selling Experiment Week 1 – My Buying and selling – 7 October 2025


Auto-posted whereas I am in Tokyo. Working these assessments 24/7 on VPS.

I have been operating the identical Gold buying and selling prompts via three completely different AI fashions for every week. Similar account, identical skilled advisor (DoIt Alpha Pulse AI), utterly completely different considering patterns.

This is what’s really occurring with Claude, GPT-5, and Gemini once they analyze Gold.

The Check Setup (You Can Replicate This)

The Precise Immediate I am Utilizing

Present XAUUSD: [price] Final 3 H1 candles: [data] Session: [London/NY/Asian] Information at present: [economic calendar] Ought to I: Purchase/Promote/Maintain? Threat: 0.5% max Goal: Threat-reward 1:2 minimal Clarify reasoning in 50 phrases max.

Easy. Clear. Similar for all three fashions.

Testing Circumstances

  • Demo account: $5000
  • Every mannequin will get: $1500 allocation
  • Similar trades supplied: All three see equivalent setups
  • Choice tracked: Even once they say “Maintain”
  • Time recorded: Response pace issues

Early Observations (Not Conclusions)

GPT-5: The Overthinker

Response time: 3-5 seconds

GPT-5 retains discovering patterns which may not exist. Yesterday it mentioned:

“The three-candle formation resembles the Could 2023 reversal sample mixed with present DXY weak point suggesting institutional accumulation nonetheless the quantity profile signifies…”

Downside: By the point it finishes considering, the entry is gone.

Fascinating habits: It catches refined correlations. Seen that Gold was ignoring Greenback energy as a result of bond yields have been additionally rising. That is really refined.

Present standing:

  • Alerts generated: 12
  • Trades taken: 4 (others too sluggish)
  • Win charge: 50% (2 wins, 2 losses)
  • P&L: +45 pips

Claude Opus 4.1: The Velocity Dealer

Response time: 1-2 seconds

Claude makes selections FAST. Generally too quick. Its responses are like:

“Bullish. London open + help held + Greenback weak. Purchase.”

Power: In quick markets, Claude really will get fills. Throughout Wednesday’s volatility, it was the one mannequin that caught the reversal.

Weak spot: Much less nuanced. Missed the Bond/Gold correlation utterly.

Present standing:

  • Alerts generated: 18
  • Trades taken: 11
  • Win charge: 54% (6 wins, 5 losses)
  • P&L: +72 pips

Gemini 2.5: The Conservative One

Response time: 2-4 seconds (varies)

Gemini is extra cautious. Generally passes on trades the others take. Tuesday it mentioned:

“No clear edge. Recommend ready for higher setup.”

This occurs extra with Gemini than GPT or Claude.

Surprising energy: Threat administration. When unsure, it usually suggests smaller positions. The one mannequin that usually says “scale back threat to 0.25%” when confidence is decrease.

Minor weak point: Generally TOO conservative, lacking good strikes whereas ready for “excellent” setups.

Present standing:

  • Alerts generated: 9
  • Trades taken: 5
  • Win charge: 60% (3 wins, 2 losses)
  • P&L: +38 pips

The Fascinating Discovery: They Generally Disagree

More often than not, they agree on course. However this is what occurred Thursday at London open:

Gold value: 1952.30
Setup: Break above Asian excessive

  • GPT-5: “Look ahead to pullback to 1950”
  • Claude: “Purchase now, momentum constructing”
  • Gemini: “Purchase however smaller place”

Similar bullish bias, completely different approaches to entry.

Claude entered instantly. Gold ran to 1958. Claude obtained the most effective entry.
However all three would have been worthwhile – simply completely different quantities.

What’s Really Priceless Right here

Velocity vs Intelligence Commerce-off

  • Want quick selections? Claude
  • Want deep evaluation? GPT-5
  • Want threat administration? Gemini (surprisingly)

Price Per Choice (This Week)

  • GPT-5: $0.12 common
  • Claude: $0.08 common
  • Gemini: $0.06 common

Claude is 33% cheaper AND quicker. However GPT-5’s two wins have been larger (+40 and +35 pips vs Claude’s common of +20).

The “Confidence” Downside

None of those fashions say “I do not know” sufficient. They all the time have an opinion, even once they should not.

I am testing including this to prompts:

If unclear, say "No edge - skip this setup"
Confidence required: 70% minimal 

Early outcomes: 40% fewer alerts, however higher win charge.

The Framework That is Rising

After one week, this is what I am studying:

Use Claude When:

  • Information is about to hit (pace issues)
  • London/NY session opens (momentum trades)
  • You want fast selections on clear setups

Use GPT-5 When:

  • Asian session (extra time to suppose)
  • Advanced correlations matter
  • You may anticipate excellent entries

Use Gemini When:

  • You desire a second opinion
  • Threat administration is precedence
  • Testing new methods (it is extra conservative)

What’s Really Working Nicely

Easy Operations

One factor that shocked me – DoIt Alpha Pulse AI handles all three fashions with out points:

  • No API errors (correct error dealing with inbuilt)
  • No charge restrict issues (clever request administration)
  • Constant connections throughout all fashions

That is really our aggressive benefit. Whereas others wrestle with integration, we simply… commerce.

The Actual Variations Are Delicate

The fashions are extra related than completely different. All of them:

  • Catch fundamental help/resistance
  • Perceive development course
  • React to main information

The variations are in model, not substance:

  • Claude: Direct and quick
  • GPT-5: Detailed and considerate
  • Gemini: Cautious and measured

The “Rationalization Tax”

Asking for reasoning provides:

  • 1-2 seconds to response time
  • 2x the token value
  • Generally overthinking easy setups

However it’s value it for studying what the AI “sees”

What I am Testing Subsequent Week

Experiment 1: Consensus Buying and selling

Solely take trades the place 2 of three fashions agree. Principle: Larger conviction setups.

Experiment 2: Time-Based mostly Rotation

  • Asian: Gemini (conservative for quiet markets)
  • London: Claude (pace for breakouts)
  • NY: GPT-5 (complexity of US session)

Experiment 3: Specialised Prompts

As an alternative of 1 immediate for all, optimize for every mannequin’s strengths:

  • Claude: Quick, action-focused
  • GPT-5: Embrace correlation evaluation
  • Gemini: Add threat parameters

The Sincere Actuality

After one week of parallel testing, the fashions carry out equally on Gold buying and selling.

All of them catch the plain strikes. The variations are marginal – perhaps 5-10% efficiency variance. The ability is not choosing the “proper” AI – it is writing higher prompts.

That is why DoIt Alpha Pulse AI helps all of them. Not as a gimmick, however as a result of completely different market circumstances want various kinds of considering.

Your Homework Whereas I am in Japan

When you’ve got DoIt Alpha Pulse AI, do that:

  1. Run the identical setup via completely different fashions
  2. Doc once they disagree
  3. Monitor which one was proper
  4. Share findings

By the point I am again, we’ll have crowd-sourced knowledge on which mannequin works greatest for what.

The Questions I am Investigating in Tokyo

Assembly with quant merchants right here who’ve been utilizing AI longer:

  1. How do they deal with mannequin disagreement?
  2. What’s their strategy to consensus?
  3. How do they optimize for latency from Asia?
  4. Are there fashions we’re not contemplating?

Present Scoreboard (Week 1)

Velocity Champion: Claude (1-2 seconds)
Accuracy Chief: Gemini (60% win charge however small pattern)
Complexity Grasp: GPT-5 (catches refined patterns)
Price Winner: Gemini ($0.06/choice)
Reliability: Claude (most constant)

However bear in mind – that is one week of information. Not conclusions, simply observations.

The Actual Worth of This Experiment

It is not about discovering the “greatest” mannequin. It is about understanding that AI buying and selling technique is not one-size-fits-all.

Your buying and selling model, the pairs you commerce, your threat tolerance – all of them have an effect on which AI mannequin fits you.

That is why the immediate is extra vital than the mannequin. A fantastic immediate on Claude beats a nasty immediate on GPT-5 each time.

Wish to run your personal AI mannequin experiments?

Get DoIt Alpha Pulse AI – Now $397

Helps all main AI fashions. Change between them immediately. Discover what works for YOUR buying and selling.

P.S. – Nonetheless in Tokyo. These fashions are operating 24/7 on my VPS. After I test in from my resort, I see Claude and GPT-5 arguing about whether or not 1958 is resistance or help. Even AIs cannot agree on fundamental TA.

P.P.S. – When you’re testing fashions your self, doc every thing. The patterns solely emerge with knowledge, not hunches.



Source link

Related articles

International LNG provide surge will drive lasting demand development, says ADNOC Fuel CEO

(Bloomberg) – A looming surge in liquefied pure fuel provide by means of the top of the last decade is poised to create demand that gained’t go away, mentioned an ADNOC Fuel govt. Costs...

Bitcoin Play Metaplanet Suspends Inventory Warrants For 20 Days – Particulars

Trusted Editorial content material, reviewed by main business consultants and seasoned editors. Advert Disclosure Japanese funding firm Metaplanet at present introduced that it's quickly pausing its inventory acquisition rights. In response to information from...

Fashionable beat-’em-ups, platformers and RPGs, and different new indie video games price trying out

Welcome to our newest roundup of what is going on on within the indie recreation area. Some beautiful new video games arrived this week, and we have some demos and divulges from upcoming...

Drug License Suspended For One MedPlus Properly being Retailer In Karnataka

MedPlus Properly being Corporations Ltd.’s subsidiary Optival Properly being Choices Pvt. Ltd. has acquired a suspension order for a drug license of a retailer situated in Karntaka, primarily based on an alternate submitting...

Hyperliquid DEX Outperforms High Crypto Exchanges Coinbase, Binance, Robinhood With Zero Downtime

Right now’s crypto market crash triggered main congestion at a number of the high centralized exchanges like Binance, Coinbase, and many others., with order books flooding. Nonetheless, decentralized trade (DEX) Hyperliquid as...
spot_img

Latest articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

WP2Social Auto Publish Powered By : XYZScripts.com