P2048 - [USACO 2024 US Open Contest Silver] Problem 3. The 'Winning' Gene

文件提交：无需freopen 内存限制：512 MB 时间限制：2.000 S

评测方式：普通裁判命题人：

提交：4 解决：0

After years of hosting games and watching Bessie get first place over and over, Farmer John has realized that this can't be accidental. Instead, he concludes that Bessie must have winning coded into her DNA so he sets out to find this "winning" gene.

He devises a process to identify possible candidates for this "winning" gene. He takes Bessie's genome, which is a string of length where . He picks some pair where representing that the "winning" gene candidates will have length and will be found within a larger length substring. To identify the gene, he takes all length substrings from which we will call a -mer. For a given -mer, he takes all length substrings, identifies the lexicographically minimal substring as a winning gene candidate (choosing the leftmost such substring if there is a tie), and then writes down the -indexed position where that substring starts in to a set .

Since he hasn't picked and yet, he wants to know how many candidates there will be for every pair of .

For each in , help him determine the number of pairs with.

N

representing the length of the string.

S

representing the given string. All characters are guaranteed to be uppercase characters where

s_{i} \in A - Z

since bovine genetics are far more advanced than ours.

For each

v

1 \dots N

, output the number of

(K, L)

pairs with

| P | = v

, with each number on a separate line.

8
AGTCAACG

In this test case, the third line of the output is 5 because we see that there are exactly 5 pairs of and that allow for three "winning" gene candidates. These candidates are (where is -indexed): [expand to see graph]

(4,2) -> P = [0,3,4]
(5,3) -> P = [0,3,4]
(6,4) -> P = [0,3,4]
(6,5) -> P = [0,1,3]
(6,6) -> P = [0,1,2]

To see how (4,2) leads to these results, we take all

4

-mers [expand to see graph]

AGTC
GTCA
TCAA
CAAC
AACG

For each

4

-mer, we identify the lexicographically minimal length 2 substring [expand to see graph]

AGTC -> AG
GTCA -> CA
TCAA -> AA
CAAC -> AA
AACG -> AA

We take the positions of all these substrings in the original string and add them to a set

P

to get

P = [0, 3, 4]

On the other hand, if we focus on the pair , we see that this only leads to total "winning" gene candidates. If we take all -mers and identify the lexicographically minimum length substring (using A and A' and A* to distinguish the different As), we get [expand to see graph]

AGTC -> A
GTCA' -> A'
TCA'A* -> A'
CA'A*C -> A'
A'A*CG -> A'

While both A' and A* are lexicographically minimal in the last 3 cases, the leftmost substring takes precedence so A' is counted as the only candidate in all of these cases. This means that

P = [0, 4]

SCORING:

Inputs 2-4:
Inputs 5-7:
Inputs 8-16: No additional constraints.

USACO

提交提交记录统计露一手!

2048: [USACO 2024 US Open Contest Silver] Problem 3. The 'Winning' Gene

题目描述

输入

输出

样例输入

样例输出

提示

SCORING:

来源/分类