Singularity

134 readers
1 users here now

Everything pertaining to the technological singularity and related topics, e.g. AI, human enhancement, etc.

founded 2 years ago
MODERATORS
1
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/IlustriousTea on 2025-01-31 22:48:13+00:00.

2
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/YakFull8300 on 2025-01-31 22:18:26+00:00.

3
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/Charuru on 2025-01-31 20:30:44+00:00.

4
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/nsshing on 2025-01-31 20:32:27+00:00.

5
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/Awkward-Raisin4861 on 2025-01-31 20:30:10+00:00.

6
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/FosterKittenPurrs on 2025-01-31 20:22:12+00:00.


I made a simple python script to play Minesweeper via CLI, to ignore any vision issues and test this with all models. Its outputs look like this:

Coordinates: x (horizontal) and y (vertical) start at 0

Current board:

0 0 0 1 - - 1 0 0

0 0 0 1 - - 2 0 0

0 0 0 1 3 - 3 1 0

0 0 0 0 1 2 - 1 0

1 1 1 0 0 1 1 1 0

- - 2 0 0 0 0 0 0

- - 3 1 1 0 0 0 0

- - - - 2 1 1 1 1

- - - - - - - - -

Enter action (r x y to reveal, f x y to flag):

Same random seed each time, so difficulty stays the same. Made it so their usual first move, r 4 4, reveals a large area, so they can demonstrate actual reasoning, not guesswork. This board is trivial for anyone who knows how to play.

I basically tried this with all existing models a few days ago, including the big o1, the big r1, Sonnet 3.5, etc. None of them could solve it.

o3 can do it. Just the plain o3-mini, I didn't try the o3-mini-high yet.

It might not seem like much, but it shows a clear ability for spatial reasoning, that is well beyond any others. We kind of already new this from the ARC stuff, I guess, but I'm impressed to see it in action.

Python script, for anyone curious:

7
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/Independent_Pitch598 on 2025-01-31 20:17:35+00:00.

8
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/MetaKnowing on 2025-01-31 20:10:11+00:00.

9
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/imDaGoatnocap on 2025-01-31 20:08:46+00:00.

10
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/vsauerr on 2025-01-31 19:39:59+00:00.

11
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/jaundiced_baboon on 2025-01-31 19:22:02+00:00.

12
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/d1ez3 on 2025-01-31 19:16:38+00:00.

13
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/jaundiced_baboon on 2025-01-31 19:15:38+00:00.

14
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/pigeon57434 on 2025-01-31 19:12:52+00:00.


15
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/kevinmise on 2025-01-31 19:12:20+00:00.

16
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/ShreckAndDonkey123 on 2025-01-31 19:11:41+00:00.

17
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/man-o-action on 2025-01-31 19:05:43+00:00.


Edit : I am testing a 1500 line javascript code which o1 pro failed to debug despite 50+ attempts. Will report back.

Edit 2: We are cooked. o3-mini-high solved it at first try.

Edit 3 : HOLY SHIT! "Pro users will have unlimited access to both o3-mini and o3-mini-high."

(Source: )

18
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/SnooPuppers3957 on 2025-01-31 18:22:22+00:00.

19
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/ShreckAndDonkey123 on 2025-01-31 18:03:22+00:00.

20
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/Different-Froyo9497 on 2025-01-31 17:58:24+00:00.

21
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/Bena0071 on 2025-01-31 17:36:19+00:00.

22
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/assymetry1 on 2025-01-31 16:24:44+00:00.

23
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/Glittering-Neck-2505 on 2025-01-31 14:46:24+00:00.

24
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/MetaKnowing on 2025-01-31 13:35:02+00:00.

25
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/ShreckAndDonkey123 on 2025-01-31 12:23:01+00:00.

view more: next ›