Update GreaterThan_Detector_NN_ReadMe.md
Browse files
GreaterThan_Detector_NN_ReadMe.md
CHANGED
|
@@ -1,10 +1,12 @@
|
|
| 1 |
# GreaterThan_Detector_NN: A Challenge in Numerical Reasoning
|
| 2 |
|
| 3 |
-
|
|
|
|
| 4 |
|
| 5 |
GreaterThan_Detector_NN: A Challenge in Numerical Reasoning
|
| 6 |
|
| 7 |
-
This repository, part of the Neural_Nets_Doing_Simple_Tasks collection, explores a fundamental question: can a general-purpose neural network learn a task that is trivial for humans but requires symbolic reasoning?
|
|
|
|
| 8 |
|
| 9 |
The Objective
|
| 10 |
|
|
@@ -17,16 +19,10 @@ Prompt:
|
|
| 17 |
Generated code
|
| 18 |
10.00 , 09.21 Which is Greater ?
|
| 19 |
|
| 20 |
-
|
| 21 |
Expected Completion:
|
| 22 |
|
| 23 |
Generated code
|
| 24 |
10.00!
|
| 25 |
-
IGNORE_WHEN_COPYING_START
|
| 26 |
-
content_copy
|
| 27 |
-
download
|
| 28 |
-
Use code with caution.
|
| 29 |
-
IGNORE_WHEN_COPYING_END
|
| 30 |
|
| 31 |
This makes the full, correct sequence:
|
| 32 |
10.00 , 09.21 Which is Greater ? 10.00!
|
|
@@ -91,7 +87,8 @@ if __name__ == '__main__':
|
|
| 91 |
|
| 92 |
Baseline Model Performance
|
| 93 |
|
| 94 |
-
A baseline sequence model was trained on a dataset of 8,000 examples generated by the function above.
|
|
|
|
| 95 |
|
| 96 |
Prompt Model's Completion Result
|
| 97 |
10.00 , 09.21 Which is Greater ? 10.00! β
Correct
|
|
@@ -99,6 +96,7 @@ Prompt Model's Completion Result
|
|
| 99 |
54.12 , 54.13 Which is Greater ? 54.13! β
Correct
|
| 100 |
99.98 , 99.99 Which is Lesser ? 99.99! β Incorrect
|
| 101 |
00.01 , 10.00 Which is Lesser ? 00.00! β Incorrect
|
|
|
|
| 102 |
Analysis & The Challenge
|
| 103 |
|
| 104 |
The baseline model demonstrates a classic problem in machine learning: it has learned to be a good pattern matcher but has not acquired a robust, generalizable algorithm for numerical comparison.
|
|
@@ -127,16 +125,15 @@ Inside your directory, add your notebook (.ipynb) or scripts (.py) that define,
|
|
| 127 |
|
| 128 |
Please include a GreaterThan_Detector_NN_ReadMe.md in your directory explaining your approach, architecture, and results.
|
| 129 |
|
| 130 |
-
Submit a "community" comment to
|
| 131 |
|
| 132 |
-
Let's see who can build and train the most reliable numerical reasoner!
|
| 133 |
|
| 134 |
License
|
| 135 |
|
| 136 |
The def generate_synthetic_data() function is open-sourced under the MIT License. See the LICENSE file for more details.
|
| 137 |
|
| 138 |
|
| 139 |
-
|
| 140 |
Notes:
|
| 141 |
Generating synthetic data for tasks: ['4digit_Greater', '4digit_Lesser']...
|
| 142 |
Generated 8000 total sequences.
|
|
|
|
| 1 |
# GreaterThan_Detector_NN: A Challenge in Numerical Reasoning
|
| 2 |
|
| 3 |
+
Here is a GreaterThan_Detector_NN_ReadMe.md file designed for your Hugging Face repository.
|
| 4 |
+
It reports the results, presents the core challenge, and provides the dataset generator without disclosing the proprietary model details, inviting the community to tackle the problem.
|
| 5 |
|
| 6 |
GreaterThan_Detector_NN: A Challenge in Numerical Reasoning
|
| 7 |
|
| 8 |
+
This repository, part of the Neural_Nets_Doing_Simple_Tasks collection, explores a fundamental question: can a general-purpose neural network learn a task that is trivial for humans but requires symbolic reasoning?
|
| 9 |
+
The specific task is to compare two numbers presented in a natural language format and identify the greater or lesser one.
|
| 10 |
|
| 11 |
The Objective
|
| 12 |
|
|
|
|
| 19 |
Generated code
|
| 20 |
10.00 , 09.21 Which is Greater ?
|
| 21 |
|
|
|
|
| 22 |
Expected Completion:
|
| 23 |
|
| 24 |
Generated code
|
| 25 |
10.00!
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 26 |
|
| 27 |
This makes the full, correct sequence:
|
| 28 |
10.00 , 09.21 Which is Greater ? 10.00!
|
|
|
|
| 87 |
|
| 88 |
Baseline Model Performance
|
| 89 |
|
| 90 |
+
A baseline sequence model was trained on a dataset of 8,000 examples generated by the function above.
|
| 91 |
+
While it learned the general format and solved many cases correctly, its performance on tricky edge cases reveals a critical weakness.
|
| 92 |
|
| 93 |
Prompt Model's Completion Result
|
| 94 |
10.00 , 09.21 Which is Greater ? 10.00! β
Correct
|
|
|
|
| 96 |
54.12 , 54.13 Which is Greater ? 54.13! β
Correct
|
| 97 |
99.98 , 99.99 Which is Lesser ? 99.99! β Incorrect
|
| 98 |
00.01 , 10.00 Which is Lesser ? 00.00! β Incorrect
|
| 99 |
+
|
| 100 |
Analysis & The Challenge
|
| 101 |
|
| 102 |
The baseline model demonstrates a classic problem in machine learning: it has learned to be a good pattern matcher but has not acquired a robust, generalizable algorithm for numerical comparison.
|
|
|
|
| 125 |
|
| 126 |
Please include a GreaterThan_Detector_NN_ReadMe.md in your directory explaining your approach, architecture, and results.
|
| 127 |
|
| 128 |
+
Submit a "community" comment to link your solution to this repository.
|
| 129 |
|
| 130 |
+
Let's see who can build and train the most reliable numerical "Greater Than" reasoner!
|
| 131 |
|
| 132 |
License
|
| 133 |
|
| 134 |
The def generate_synthetic_data() function is open-sourced under the MIT License. See the LICENSE file for more details.
|
| 135 |
|
| 136 |
|
|
|
|
| 137 |
Notes:
|
| 138 |
Generating synthetic data for tasks: ['4digit_Greater', '4digit_Lesser']...
|
| 139 |
Generated 8000 total sequences.
|