• VNOJ
  • Trang chủ
  • Danh sách bài
  • Các bài nộp
  • Thành viên
  • Tổ chức
  • Các kỳ thi
  • Thông tin
    >
    • Máy chấm
    • Custom Checkers
    • Github
VI EN Đăng nhập  hoặc  Đăng ký

Blog - Trang 2

  • Thông tin
  • Thống kê
  • Blog

-4

mario party 9 good job

yoshi_fp36 đã đăng vào 13, Tháng 5, 2026, 2:45

also known as

  • mario party 9 the beginning of an adventure
  • mario party 10 nice job
  • mario party star rush sharacter museum
  • mario party the top 100 collection bgm
  • mario party superstars staff credits of mario party 9/10
  • super mario party jamboree free play results
yoshi_fp36
o13, Tháng 5, 2026, 2:45 9

-4

How to tune llama.cpp for free

yoshi_fp36 đã đăng vào 12, Tháng 5, 2026, 8:37

Here's a 100% legit tutorial to tune for llama.cpp MoE models. Very simple.

To start, you need to determine your target context size and other parameters (e.g. context size 64k, ctk q8_0, etc.). Enable flash attention.

We'll focus on the two main parameters: -ncm and -ub. Set the "batch size" -b to 4096 and left there.

Now, here's simple initial tuning:

If targetting maximum tg128, etc:

  • Decrease ncm until OOM, then increase ub in steps of 256 until reached 4096 or OOM. this is most effective when ncm = 0, but not really when ncm is high (see below).

Now, there's the option to do more comprehensive tuning. You can use this table as your template and change if neccessary.

here MAX can refer to your context size or [your context size] - 4096. Choose any but the former will leave a slight VRAM budget for other things.

ub ncm min pp4096 @ d8192 tg128 @ d8192 pp4096 @ dMAX tg128 @ dMAX
512 ... ... ... ... ...
1024
...
4096 (or OOM)

Basically you can do the d8192 tests first and see what fits you best (maximum tg or PP.) If you have low VRAM, chances that just setting ub to max won't change the tg much, but pp performance can literally double. After that you can test on depths with step size 4k from 0 to MAX.

yoshi_fp36
o12, Tháng 5, 2026, 8:37 1

-3

Random table

yoshi_fp36 đã đăng vào 12, Tháng 5, 2026, 8:21
model size params backend ngl ncpumoe cpu_strict n_batch n_ubatch fa mmap dio test t/s
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 pp4096 518.25 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 tg128 19.79 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 pp4096 @ d5120 453.65 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 tg128 @ d5120 19.83 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 pp4096 @ d10240 405.45 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 tg128 @ d10240 19.43 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 pp4096 @ d15360 367.85 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 tg128 @ d15360 19.07 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 pp4096 @ d20480 334.75 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 tg128 @ d20480 18.30 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 pp4096 @ d25600 311.31 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 tg128 @ d25600 18.64 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 pp4096 @ d30720 287.13 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 tg128 @ d30720 17.87 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 pp4096 @ d35840 267.69 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 tg128 @ d35840 18.42 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 pp4096 @ d40960 250.68 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 tg128 @ d40960 17.50 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 pp4096 @ d46080 237.45 ± 0.00
qwen35moe 35B.A3B IQ4_NL - 4.5 bpw 16.79 GiB 34.66 B CUDA 99 38 1 4096 1536 1 0 1 tg128 @ d46080 17.19 ± 0.00
yoshi_fp36
o12, Tháng 5, 2026, 8:21 0

-4

Memorandum to Performance Analysts

yoshi_fp36 đã đăng vào 8, Tháng 5, 2026, 15:03

SPEC

   subject: SPEC Run Rules for CINT92     date: January 23, 1992

                                          from: SPEC Steering Committee



                             ABSTRACT



   This document provides guidelines for how to run the
   benchmarks, how to report the results (which is explained
   more fully in the document SPEC Reporting Rules) and how to
   report results for modified versions of benchmarks.  It
   contains several examples of necessary types of changes.




   Memorandum to
   Performance Analysts









































                                          SPEC



   subject: SPEC Run Rules for CINT92     date: January 23, 1992

                                          from: SPEC Steering Committee



   1.  General

   The general philosophy behind this set of rules for running
   the SPEC benchmarks is to ensure repeatability and
   documentation of all factors pertinent to duplicating
   reported performance results. Suggestions for improving this
   run methodology should be made to SPEC for consideration in
   future releases.

   In general:

      - All times reported must be for runs that produce
        correct results.

      - Single user mode is allowed.  This must be documented
        in a report of the results.

      - These benchmarks have been run across a number of
        platforms with several compilers, but are not
        guaranteed to be free of language extensions that are
        commonly found, but not part of industry standard
        language specifications.

      - Use of software features (in pre-processors, compilers,
        etc.) which invoke, generate or use software designed
        specifically for any of the SPEC Benchmark Releases is
        not allowed.

      - Source code may not be modified unless it is required
        for portability. See Section 4 (Source Code Changes)
        for specifics.



   2.  Makefiles,_Compilers_and_Libraries

   Generic Makefiles are provided for convenience and insight
   into running the benchmarks. Typically a vendor supplied
   wrapper (M.ident) is used in conjunction with the generic
   Makefile. These are provided for convenience and can be
   replaced to facilitate running benchmarks on proprietary
   operating systems or to take advantage of optimizations not
   easily activated with the current method.  It is anticipated
   that the build process which is capable of building the
   highest performance executable will be employed, even if a











                              - 2 -



   different set of optimizations are activated for each
   benchmark.

   For example, one SPEC benchmark can be built with one set of
   options and another can be built with an entirely different
   set of options.

   Special libraries may be used as long as they

     1.  are products announced for availability to customers
         within 6 months of the date of test,

     2.  do not require changing benchmark source code,

     3.  implicitly replace routines in the benchmark source
         code,

     4.  are not "benchmark specific" (i.e.targeted
         specifically for a SPEC benchmark).

   Libraries can be specified in the Makefile. Source code
   changes to eliminate default high level language subroutines
   so that optimized library versions can be used is not
   allowed.

   Accuracy verification specifications (i.e. to diff or spiff)
   must not be overridden in Makefiles (or in any other
   manner).  Only SPEC approved alternative input files are
   allowed. Use of these alternate input files must be
   documented in the footnotes section.

   It should be recognized that each company is continuously
   improving its compilation system and that new optimizations
   may be required to achieve published performance ratings.
   If you are unable to reproduce a vendor's published rating,
   please contact the vendor and ensure that you have the
   latest Makefile wrappers and equivalent system
   configuration.



   3.  Pre-Processors

      - Use of a pre-processor on unmodified source code that
        is part of the identified released software is allowed.

      - Use of benchmark specific pre-processor features is not
        allowed.

      - Documented pre-processor options may be used, and
        should be specified either in the testing report or in











                              - 3 -



        the vendor Makefile (M.ident) provided on the SPEC
        tape.


   4.  Source_Code_Changes

   SPEC policy allows only portability changes to the source
   code.  When source code changes are made, a diff listing of
   the changes must be included in the testing report.

   SPEC encourages performance comparisons to be made across
   systems running the unmodified SPEC code, or if necessary,
   with the minimum required portability changes.


      - Portability

           o Source code changes required for standards
             compliance should be described in the testing
             report.  Appropriate standards documents should be
             cited.  All such changes should be reported to
             SPEC.  SPEC may consider incorporating such
             changes in future releases.  Whenever possible,
             SPEC will strive to develop and enhance benchmarks
             to be POSIX and ANSI compliant.

             SPEC recognizes that some source code changes may
             be necessary in order to meet specific language
             implementations.  Such changes must be documented,
             appropriate references must be cited and results
             marked as portability changes (denoted by * on any
             publicly reported results).  Once these changes
             are incorporated into the tape including alternate
             source files, then the asterisk may be dropped.

           o The double precision (real*8) floating point in
             SPEC benchmarks is intended to be 64 bits.  If
             this is not the case on the machine being tested,
             modify the appropriate declarations and forward to
             SPEC the source code changes required to
             accommodate the system.  Over time, SPEC desires
             to use compiler options to eliminate the need for
             such changes.

           o If any portability change is backed off, one of
             three things must happen:

               i.  benchmark code does not compile, or

              ii.  benchmark code does not execute, or












                              - 4 -



             iii.  benchmark code produces the wrong result.


   5.  Alternate_Source_Files

   Some portability changes made to the source files may be
   generally useful to other SPEC users.  These are classes of
   codes that meet the portability requirement (i.e., code
   won't compile, or won't execute, or produces wrong result),
   but are required by one or more target systems.  In such
   instances, the files are placed in a directory named
   "src.alt" in the benchmark directory.  All files placed in
   this directory have been reviewed and approved by the SPEC
   Steering Committee; alternate source files are otherwise not
   permitted.  When alternate source files have been approved
   for inclusion, however, the M.vendor makefile must be
   modified to retrieve the appropriate version.  In general,
   original copies of codes which are to be replaced by
   alternate copies are preserved in the directory "src.std"
   for safe-keeping.  When an alternate source file is used, it
   must be annotated in the footnotes section of the report.
   The asterisk (*) is not used to denote portability changes
   when alternate source files are employed.


   6.  Alternate_Results_Files

   On occasion, one platform will cause a variance in the
   results of the benchmark which may not necessarily be
   incorrect.  In such cases, an alternate results directory,
   called "result.alt", contains additional valid results.
   Only results that have been reviewed and approved by the
   SPEC Steering Committee are placed in this directory.
   Otherwise, all results must compare with the files in the
   "result.ref" directory.  In such cases, the M.vendor
   makefile must be modified to accommodate alternate results.
   An example of such a case can be seen with the benchmark
   085.gcc.  In this program, the output is sorted by a system
   library routine. As more vendors ran 085.gcc, it became
   apparent that the sort routine produced different orderings
   among systems.  Though the results were still equivalent,
   they could not be compared directly with the results.ref
   files, hence, the addition of alternate results became
   necessary.  If alternate results are used, they must be
   specified in the footnotes section of the report.

















                                          SPEC



   subject: SPEC Reporting Rules For      date: January 24, 1992
            CINT92
                                          from: SPEC Steering Committee



                             ABSTRACT



   This memorandum describes the Rules for reporting the
   results of SPEC CINT92 suite.  This is a companion document
   to SPEC Run Rules, which describes the process of running
   the benchmarks contained in the SPEC Suite.




   Memorandum to
   Performance Analysts










































                                          SPEC



   subject: SPEC Reporting Rules For      date: January 24, 1992
            CINT92
                                          from: SPEC Steering Committee



   1.  SPEC_CINT92_Introduction

   The SPEC CINT92 benchmark suite consists of CPU intensive
   benchmarks that are intended to be meaningful samples of
   applications which perform non-floating point logic and
   computations in a technical computing environment. SPEC
   CINT92 does not assess the ability of a system under test to
   handle disk, graphics, communication, or multi-user
   activity.

   Many of the SPEC CINT92 benchmarks have been derived from
   publicly-available application programs, and they are
   intended to be portable to as many current and future
   hardware platforms as possible.  Consequently, the version
   of the programs in this distribution contains no hardware
   dependencies.  Where possible, the programs have been
   structured to ensure that they do not unfairly favor one
   hardware architecture over another.  For this reason, the
   application programs in this distribution should not be used
   to assess the probable performance of commercially
   available, tuned versions of the same application.

   Different degrees of tuning and availability of specialized
   hardware features may make large (and varying) differences
   between the performance of the programs in this distribution
   and that of commercial applications for production use.

   The individual benchmarks in this suite may be similar, but
   not necessarily identical to benchmarks with the same name
   which are available from sources other than SPEC.  Therefore
   it may not be valid to compare the individual benchmark
   results of SPEC CINT92 to any benchmarks not obtained from
   SPEC.

   Reporting of results must follow the guidelines established
   in the "Reporting Guidelines" section of the distribution
   documentation. It is not intended that the SPEC CINT92
   benchmark suite be used as a replacement for the
   benchmarking of actual customer applications to determine
   vendor or product selection.
















                              - 2 -



   2.  REPORTING_GUIDELINES

   This section describes the standard reporting format that
   must be used when reporting results using the SPEC CINT92
   benchmarks to obtain a SPECint92 rating for a system. The
   SPECint92 rating is used to describe the speed of a system.

   EXAMPLES OF REPORTING RULES FOR SPEC RESULTS CAN BE SEEN IN
   THE QUARTERLY SPEC NEWSLETTER. 1 The intent of the SPECint92
   Reporting Guidelines is to describe a consistent method of
   reporting results to aid customers in interpreting and
   comparing results of different systems' performance.

   All measured and calculated values are reported to the
   nearest tenth.

   Unless noted to the contrary, the report must contain the
   following items of information:

   2.1  Table_of_Run_Times

   A table of 7 rows and 4 columns structured as follows:

     a.  One row for each individual benchmark in SPEC CINT92

     b.  Four columns


           i.  Name of benchmark (preferred order (numerical)
               shown in example below)

          ii.  SPEC Reference Time for each benchmark in the
               suite.

                 A.  008.espresso: 2270 seconds

                 B.  022.li: 6210 seconds

                 C.  023.eqntott: 1100 seconds

                 D.  026.compress: 2770 seconds

                 E.  072.sc: 4530 seconds



   __________

    1. Articles describing the suite and metrics can be found
       in the December 1991 SPEC Newsletter.












                              - 3 -



                 F.  085.gcc: 5460 seconds



         iii.  For SPECint92, Elapsed time (rounded to the
               nearest 0.1 second) for the System-Under-Test
               (SUT) taken to run each benchmark of SPEC CINT92
               suite.


               The elapsed times reported are for each
               benchmark run one at a time.  For the SPECint92,
               the elapsed time value measures the execution of
               a single copy of each benchmark.  The reported
               times are the best times that can be
               consistently reproduced.

               If any changes to an individual benchmark are
               made for portability purposes (e.g. to obtain a
               clean compile), the elapsed time should be
               followed by an "*", and a description of the
               change should be described in PART 7 of the
               report.  The asterisk is dropped after the
               portability change has been approved by the SPEC
               steering committee as a portability change and
               not a performance change and when this change
               has been implemented in the tape.  2

          iv.  For SPECint92, Column 4 has Ratio of SPEC
               Reference Time to SUT time (divide number in
               column 2 by number in column 3).  The ratio is
               without dimension, and will be referred to as
               SPECratio.


     c.  A row with the Geometric mean for the numbers listed
         in column 4.  This row is labeled as "Geom Mean:
         SPECint92".  The Geometric Mean is calculated as the
         6th root of the product of the individual values in
         the SPECratio column.

         For SPECint92, the Geometric Mean of the SPECratios in
         column 4 is defined as the SPECint92 of the SUT.

         Please note:  if an "*" sign appears in any of the


   __________

    2. REFER to portability rules in SPEC run rules document.












                              - 4 -



         rows to indicate a portability change in a benchmark,
         the "*" must be displayed after the Geometric Mean
         calculation for elapsed time and SPECratio.

   2.2  Identification_Information

   This information includes the date (month/year) that the
   benchmarks were run, the name and location of the company
   that ran them and the license number of the SPEC tape used.
   It is typically listed under the table generated in Section
   2.1.


   2.3  Hardware_Configuration

   A description of  the system (cpu/clock, fpu/clock) number
   of processors, relevant peripherals, etc are included in
   this space.  It is typically listed next to the table
   generated in Section 2.1.  The amount of information
   supplied should be sufficient to allow duplication of
   results by another party.  The following checklist is
   provided to show examples of hardware features which may
   affect the performance of SPEC CINT92 benchmarks.  The
   checklist is not intended to be all-inclusive, nor is each
   feature in the list required to be described.  The rule of
   thumb is: "if it affects performance or the feature is
   required to duplicate the results, describe it".

   HARDWARE PLATFORM FEATURES

      o Manufacturer's model number

      o CPU component ID and clock speed

      o Floating Point Unit (FPU) ID and clock speed

      o Number of CPUs and FPUs

      o Cache Size (per CPU), description and organization

      o Memory (Amount and description)

      o Disk subsystem configuration

           - Number of active connected disks

           - Disk controllers: ID, number and type

           - Disk: ID, number and type













                              - 5 -



      o Network Interface


   2.4  Software_Configuration


   The description of the software configuration used should be
   detailed enough so that another party could duplicate the
   results reported. This is typically reported after the
   hardware decription.  The following is a list of possible
   items that might be documented:

      o Operating System Type and Release level (with
        revision/update level if appropriate)

      o Compiler release levels used.  If multiple compilers
        are available, specify which ones were used.

      o Optional software required to reproduce results

      o File system type used

      o Firmware level


   2.5  Systems_Environment


   This section should document the overall systems environment
   used to run the benchmark. It is typically listed after the
   software description. The following is a list of possible
   items that might be documented:  SYSTEMS ENVIRONMENT
   CHECKLIST

      o Single or multi-user state

      o System tuning parameters (e.g. MAKEFS and TUNEFS
        parameters)

      o Process tuning parameters (e.g. stack size, time slice)

      o Background load, if any


   2.6  General_Availability_of_the_SUT


   This item indicates the availability (or projected
   availability) date for the system or components being
   benchmarked.  If both hardware and software are not
   available at the same time, list them separately. SPEC











                              - 6 -



   requires that the availability of hardware and software
   should be within 6 months of the date of publication of the
   data. It should be listed in Month/Year format.


   2.7  Individual_Benchmark_Notes


   This section documents ANY changes made to the individual
   benchmark source code or execution parameters for
   portability reasons.  For example, if difficulty arises in
   compiling or executing an individual benchmark because of
   portability problems, the difficulty should be described,
   and the code changes made to resolve the portability problem
   documented.  This should include module name, line number
   and actual source code changes.  This section must also
   include any use of alternate source files, alternate input
   files, or alternate result files, as noted in "Runrules"
   section of the distribution documentation.


   2.8  Graphical_Representation

   If desired, a graphical summary of the table from Section
   2.1 may be included as an additional item of information.

   If included for SPECint92, it should be shown as follows:

     a.  The 6 benchmarks comprising of  SPEC CINT92  suite
         should appear as individual points on the x-axis
         ordered by benchmark numbers.

     b.  The SPECratio of each benchmark (obtained from column
         4 in the table described above), should be plotted as
         a vertical bar graph.

     c.  A horizontal line should be drawn across the graph
         intersecting the y-axis at the SPECint92 geometric
         mean value.  This permits differentiation between each
         benchmark's SPECratio and the overall SPECint92
         geometric mean.



   3.  SAMPLE_REPORTS

















                              - 7 -



   3.1  SPECint92_-_Summary_of_SPEC_CINT92_Results

    __________________________________________________________
   | SPEC Benchmark|      SPEC    |  System Under|  SPECratio|
   |     CINT92    |   Reference  |      Test    |           |
   |               |  Time Elapsed|    (Sample)  |           |
   |               |      Time    |  Elapsed Time|           |
   |               |   (seconds)  |   (seconds)  |           |
   |_______________|______________|______________|___________|
   |_______________|______________|______________|___________|
   | 008.espresso  |      2270    |     1245.0   |    1.8    |
   |_______________|______________|______________|___________|
   | 022.li        |      6210    |     2275.0   |    2.7    |
   |_______________|______________|______________|___________|
   | 023.eqntott   |      1100    |      684.0   |    1.6    |
   |_______________|______________|______________|___________|
   | 026.compress  |      2770    |     1390.0   |    2.0    |
   |_______________|______________|______________|___________|
   | 072.sc        |      4530    |      783.0   |    5.8    |
   |_______________|______________|______________|___________|
   | 085.gcc       |      5460    |     1521.0   |    3.6*   |
   |_______________|______________|______________|___________|
   | Geometric Mean|              |              |    2.6*   |
   |_______________|______________|______________|___________|


   *Changes made to benchmark for portability reasons, see PART
   7 for details

   The geometric mean is the nth root of the product of n
   elements. Therefore, in this case it is the 6th root of the
   product of 6 SPECratios in the 4th column.


   3.2  Identification_Information

   Tested on mm/yy, tested by xxxxx of (location), SPEC License
   No. yyyyy.

   3.3  Hardware_Configuration

   Hardware Configuration Detail for "Sample" system

      o Vendor machine xxxxxxx, model number nnnnnnn

      o RISC processor xxxxxx; nn MHz clock speed

      o FPU xxxxxxxx, nn MHz clock speed

      o One processor/FPU












                              - 8 -



      o n MB main memory

      o n disk drives, each nnn megabytes capacity


   3.4  Software_Configuration

   Software Configuration Detail for "Sample" system

      o Operating System name, Version nnn, update nnn applied

      o FORTRAN Version nnn

      o C Version nnn

      o Other software:  (if applicable)

      o Firmware level:  (if applicable)



   3.5  General_Availability

   The harware wil be available mm/yy and the software will be
   available in nn/yy.

   Note: When a beta version is used, the date of the final
   release must be included which must be within 6 months of
   result publication.

   3.6  Systems_Environment

   Systems Environment Detail for "Sample" system

      o Tuning Parameters:

      o Background load:

      o System state:  (Mult-user or single user)


   3.7  Notes_On_Individual_Benchmark_Runs

      - 085.gcc: was compiled using gnu compiler * 085.gcc:
        changed variable name xxxx to yyyy to avoid library
        conflict.
yoshi_fp36
o8, Tháng 5, 2026, 15:03 1

-2

How to benchmaxxing tutorial 2026 full

yoshi_fp36 đã đăng vào 6, Tháng 5, 2026, 3:46
  1. If-testing. If the benchmark asks you to render something, pre-render it and just play the video! Free FPS!!!
  2. UNLEASH THE POWER LIMITS. THE USER WOULDN'T NOTICE ANYWAY. WHO CARES IF THE PHONE BURN THEIR HANDS FOR A 30% BETTER SCORE? (For maximum effect, make this a mode that the user has no way to access.)
  3. Use benchmark specific optimization, eg. hand tuned code, carefully selected options, and hidden flags. Using this in base rules or online platforms is recommended.
  4. Modify the code at run-time through an optimizer (eg. Intel iBOT) for maximized performance while still running the same thing. (Do not tell anyone it's not the same code...)
  5. Train on the test set and tune based on the test set. Why would anyone notice that your model is suddenly a LGM on your own "Codeforces" benchmark? (Don't tell anyone it's outputting the sample codes...)
yoshi_fp36
o6, Tháng 5, 2026, 3:46 1

-2

i think y0shi is better than yoshi

yoshi_fp36 đã đăng vào 23, Tháng 4, 2026, 3:39

it's "unemployed" and "jobless" story, i guess. and i hate y0shi. don't tell anyone this...

yoshi_fp36
o23, Tháng 4, 2026, 3:39 1

0

Fastest algorithms

yoshi_fp36 đã đăng vào 21, Tháng 4, 2026, 12:08

Sometimes, algorithmic ingenuity is only one half of the problem. Making a slow algorithm fast might be hard, but the king of algorithms is... multiple algorithms, each one being optimized and handling their respective purpose. I guess.

I tried to make "I'm an PhD in applied CS" joke, but that would be a blatant lie.

yoshi_fp36
o21, Tháng 4, 2026, 12:08 2

-1

độ dài trung bình của đoạn sinh ngẫu nhiên là n/3

yoshi_fp36 đã đăng vào 19, Tháng 4, 2026, 5:01

đừng nghĩ rằng nó là n/2, thực ra nó là n/3 nên bạn nên nghĩ rằng code bạn sẽ chạy chậm gấp 3 lần ở max test 1d (hoặc 9 lần với 2d) nếu làm trâu...

yoshi_fp36
o19, Tháng 4, 2026, 5:01 0

-6

carpet làm trâu ac

yoshi_fp36 đã đăng vào 18, Tháng 4, 2026, 12:33

mặc dù nmk là 1.8e9 nhưng không sao, nếu bạn hít nhiều chất tên là AVX2, bạn có thể khiến cho máy chấm ép bạn AC được... mik không chỉ cách đâu hihihihihi

yoshi_fp36
o18, Tháng 4, 2026, 12:33 3

0

thằng nào downvote

yoshi_fp36 đã đăng vào 16, Tháng 4, 2026, 13:56

downvote tiếp đi!!! downvote đi!!! không ai quan tâm đâu...

init():

    ret

Yoshi():

    push    rax

    push2p  rbp, r15

    push2p  r14, r13

    push2p  r12, rbx

    sub     rsp, 16

    mov     rdi, qword ptr [rip + std::cin@GOTPCREL]

    lea     rsi, [rsp + 4]

    call    std::istream::operator>>(int&)@PLT

    movsxd  r12, dword ptr [rsp + 4]

    test    r12, r12

    js      .LBB1_37

    test    r12d, r12d

    je      .LBB1_15

    {nf}    shl     r15, r12, 2

    mov     rdi, r15

    call    operator new(unsigned long)@PLT

    mov     rbx, rax

    mov     dword ptr [rax], 0

    lea     r14, [rax + 4]

    cmp     r12d, 1

    je      .LBB1_4

    lea     r13, [rbx + r15]

    lea     rdx, [r15 - 4]

    mov     rdi, r14

    xor     esi, esi

    call    memset@PLT

    mov     r14, r13

.LBB1_4:

    lea     rbp, [rbx + 4*r12]

    mov     r15, qword ptr [rip + std::cin@GOTPCREL]

    mov     r12, rbx

    mov     qword ptr [rsp + 8], rbp

.LBB1_5:

    mov     rdi, r15

    mov     rsi, r12

    call    std::istream::operator>>(int&)@PLT

    add     r12, 4

    cmp     r12, r14

    jne     .LBB1_5

    movsxd  r13, dword ptr [rsp + 4]

    test    r13, r13

    js      .LBB1_38

    test    r13d, r13d

    je      .LBB1_16

    {nf}    shl     r12, r13, 2

    mov     rdi, r12

    call    operator new(unsigned long)@PLT

    mov     r14, rax

    mov     dword ptr [rax], 0

    lea     r15, [rax + 4]

    cmp     r13d, 1

    je      .LBB1_12

    lea     rbp, [r14 + r12]

    lea     rdx, [r12 - 4]

    mov     rdi, r15

    xor     esi, esi

    call    memset@PLT

    mov     r15, rbp

.LBB1_12:

    lea     rbp, [r14 + 4*r13]

    mov     r12, qword ptr [rip + std::cin@GOTPCREL]

    mov     r13, r14

.LBB1_13:

    mov     rdi, r12

    mov     rsi, r13

    call    std::istream::operator>>(int&)@PLT

    add     r13, 4

    cmp     r13, r15

    jne     .LBB1_13

    jmp     .LBB1_17

.LBB1_15:

    xor     ebx, ebx

    mov     qword ptr [rsp + 8], 0

.LBB1_16:

    xor     ebp, ebp

    xor     r14d, r14d

.LBB1_17:

    mov     ecx, 1

    xor     esi, esi

    xor     edi, edi

    jmp     .LBB1_19

.LBB1_18:

    inc     rdi

    inc     rcx

    cmp     rdi, 3000

    je      .LBB1_28

.LBB1_19:

    cmp     rdi, 2998

    ja      .LBB1_18

    mov     r8d, dword ptr [rbx + 4*rdi]

    mov     r9d, dword ptr [r14 + 4*rdi]

    mov     r10, rcx

    jmp     .LBB1_23

.LBB1_21:

    mov     eax, r16d

.LBB1_22:

    {nf}    shl     edx, eax, 13

    xor     eax, edx

    {nf}    shr     edx, eax, 17

    xor     eax, edx

    {nf}    shl     edx, eax, 5

    xor     eax, edx

    add     rsi, rax

    inc     r10

    cmp     r10, 3000

    je      .LBB1_18

.LBB1_23:

    mov     eax, dword ptr [rbx + 4*r10]

    {nf}    sub     edx, r8d, eax

    sub     eax, r8d

    cmovl   eax, edx

    mov     edx, dword ptr [r14 + 4*r10]

    {nf}    sub     r11d, r9d, edx

    sub     edx, r9d

    cmovl   edx, r11d

    cmp     eax, edx

    cmovb   r16d, edx, eax

    cmovbe  eax, edx

    test    r16d, r16d

    je      .LBB1_22

    xor     edx, edx

    div     r16d

    test    edx, edx

    je      .LBB1_21

    tzcnt   eax, edx

    tzcnt   r11d, r16d

    shrx    edx, edx, eax

    shrx    r16d, r16d, r11d

.LBB1_26:

    {nf}    sub     r17d, r16d, edx

    sub     r18d, edx, r16d

    cmovae  edx, r16d

    cmovae  r17d, r18d

    tzcnt   r16d, r18d

    shrx    r16d, r17d, r16d

    test    r16d, r16d

    jne     .LBB1_26

    cmp     r11d, eax

    cmovb   eax, r11d

    shlx    eax, edx, eax

    jmp     .LBB1_22

.LBB1_28:

    mov     rdi, qword ptr [rip + std::cout@GOTPCREL]

    call    std::ostream& std::ostream::_M_insert<long>(long)@PLT

    mov     byte ptr [rsp + 3], 10

    mov     rcx, qword ptr [rax]

    mov     rcx, qword ptr [rcx - 24]

    cmp     qword ptr [rax + rcx + 16], 0

    je      .LBB1_31

    lea     rsi, [rsp + 3]

    mov     edx, 1

    mov     rdi, rax

    call    std::basic_ostream<char, std::char_traits<char>>& std::__ostream_insert<char, std::char_traits<char>>(std::basic_ostream<char, std::char_traits<char>>&, char const*, long)@PLT

    jmp     .LBB1_32

.LBB1_31:

    mov     rdi, rax

    mov     esi, 10

    call    std::ostream::put(char)@PLT

.LBB1_32:

    test    r14, r14

    je      .LBB1_34

    {nf}    sub     rsi, rbp, r14

    mov     rdi, r14

    call    operator delete(void*, unsigned long)@PLT

.LBB1_34:

    test    rbx, rbx

    je      .LBB1_36

    {nf}    sub     rsi, qword ptr [rsp + 8], rbx

    mov     rdi, rbx

    call    operator delete(void*, unsigned long)@PLT

.LBB1_36:

    add     rsp, 16

    pop2p   rbx, r12

    pop2p   r13, r14

    pop2p   r15, rbp

    pop     rax

    ret

.LBB1_37:

    lea     rdi, [rip + .L.str.4]

    call    std::__throw_length_error(char const*)@PLT

.LBB1_38:

    lea     rdi, [rip + .L.str.4]

    call    std::__throw_length_error(char const*)@PLT

    jmp     .LBB1_47

    mov     r15, rax

    test    r14, r14

    jne     .LBB1_45

    test    rbx, rbx

    jne     .LBB1_48

    jmp     .LBB1_43

    mov     r15, rax

.LBB1_45:

    {nf}    sub     rsi, rbp, r14

    mov     rdi, r14

    call    operator delete(void*, unsigned long)@PLT

    test    rbx, rbx

    jne     .LBB1_48

.LBB1_43:

    mov     rdi, r15

    call    _Unwind_Resume@PLT

.LBB1_47:

    mov     r15, rax

.LBB1_48:

    {nf}    sub     rsi, qword ptr [rsp + 8], rbx

    mov     rdi, rbx

    call    operator delete(void*, unsigned long)@PLT

    mov     rdi, r15

    call    _Unwind_Resume@PLT

main:

    push    rax

    xor     edi, edi

    call    std::ios_base::sync_with_stdio(bool)@PLT

    mov     rax, qword ptr [rip + std::cin@GOTPCREL]

    mov     rcx, qword ptr [rax]

    mov     rcx, qword ptr [rcx - 24]

    mov     qword ptr [rax + rcx + 216], 0

    lea     rdi, [rip + .L.str]

    lea     rsi, [rip + .L.str.1]

    call    fopen@PLT

    test    rax, rax

    je      .LBB2_2

    mov     rax, qword ptr [rip + stdin@GOTPCREL]

    mov     rdx, qword ptr [rax]

    lea     rdi, [rip + .L.str]

    lea     rsi, [rip + .L.str.1]

    call    freopen@PLT

    mov     rax, qword ptr [rip + stdout@GOTPCREL]

    mov     rdx, qword ptr [rax]

    lea     rdi, [rip + .L.str.2]

    lea     rsi, [rip + .L.str.3]

    call    freopen@PLT

.LBB2_2:

    call    Yoshi()

    xor     eax, eax

    pop     rcx

    ret

.L.str:

    .asciz  ".inp"

.L.str.1:

    .asciz  "r"

.L.str.2:

    .asciz  ".out"

.L.str.3:

    .asciz  "w"

.L.str.4:

    .asciz  "cannot create std::vector larger than max_size()"

DW.ref._gxxpersonality_v0:

    .quad   __gxx_personality_v0
yoshi_fp36
o16, Tháng 4, 2026, 13:56 15
  • «
  • 1
  • 2
  • 3
  • »

dựa trên nền tảng DMOJ | theo dõi VNOI trên Github và Facebook