久久精品国产精品久久久,无码少妇精品一区二区免费动态

論壇徽章:: 0

電梯直達(dá)

1樓 [收藏(0)] [報(bào)告]

發(fā)表于 2013-04-27 13:51 |只看該作者 |倒序?yàn)g覽

my $str = 'if a then b do f end end'

=>
{
0 => 'if a then b do f end end',
1 => 'do f end',
}

注意這里面不只是只有 if ... end 的結(jié)構(gòu)，還有 do .. end 的結(jié)構(gòu)，這些結(jié)構(gòu)都是可嵌套的。

如果你的算法是優(yōu)秀的，那么試試能不能擴(kuò)展一下，可以計(jì)算多種結(jié)構(gòu)的嵌套。

我發(fā)布的算法，已經(jīng)有解決方案了，只是想把這些有用的算法需求，分享給大家，娛樂一下：

前面發(fā)布的一個(gè)算法需求很簡單，但得到的代碼已經(jīng)非常復(fù)雜了。實(shí)際的需求更加復(fù)雜。

一個(gè)最簡單的語言中，可嵌套的結(jié)構(gòu)除了循環(huán)，判斷語句，還有數(shù)據(jù)結(jié)構(gòu)，函數(shù)定義。這些結(jié)構(gòu)要是一起來的話，您的算法要怎么擴(kuò)展呢？

也許我的算法讓大家很受打擊，但真實(shí)的需求就是這樣的。

文庫|博客

使用正則表達(dá)式與lex實(shí)現(xiàn)詞法分析器
C語言的MIPS匯編實(shí)現(xiàn)（四）SWITCH
Requested init /linuxrc failed (error -2).
比較 csv 文件中數(shù)據(jù)差異
LMD ElPack v2019.7新版亮點(diǎn)：Transparent mode全新升級(jí)|附下載

Perlvim

小富即安

論壇徽章:: 0

2樓 [報(bào)告]

發(fā)表于 2013-05-02 23:36 |只看該作者

本帖最后由 Perlvim 于 2013-05-03 18:26 編輯

String::NestMatch 模塊的 nest_match 方法可以處理：

它接受一個(gè)首尾字符串散列作為參數(shù)，不但可以處理單字符為標(biāo)志的結(jié)構(gòu)，也能處理多字符為標(biāo)志的結(jié)構(gòu)，能夠處理任意深度。使用了最基本的字符串替換算法，可以在 sed, awk, Lua, 等算法簡單的語言中實(shí)現(xiàn)。

測(cè)試代碼：

#!perl
use 5.014;
use YAML qw(Dump);
use String::NestMatch qw(nest_match);
my $str = 'if a then if b then if c then d end end end if f then g end';
say(Dump(nest_match($str, { if => 'end'})));
my $text = "<table><tr><td>aaa</td></tr></table>";
say(Dump(nest_match($text, { '<tr>' => '</tr>', '<td>' => '</td>' })));
my $str1 = 'if a then for b in c end end';
say(Dump(nest_match($str1, { 'if' => 'end', 'for' => 'end' })));

復(fù)制代碼

輸出：

>perl -w test_nest_match.pl
---
1:
- if a then if b then if c then d end end end
- if f then g end
2:
- if b then if c then d end end
3:
- if c then d end
---
1:
- '<tr><td>aaa</td></tr>'
2:
- '<td>aaa</td>'
---
1:
- if a then for b in c end end
2:
- for b in c end
>Exit code: 0 Time: 0.549

復(fù)制代碼

模塊源代碼：

package String::NestMatch;
use Exporter;
our @ISA = qw(Exporter);
our @EXPORT_OK = qw(nest_match);
use 5.010;
use strict;
use warnings;
use YAML qw(Dump);
my $count = 127;
my $id_char = {};
my $char_id = {};
sub apply_id_char {
my $id = shift;
$count++;
my $char = chr($count);
$id_char->{$id} = $char;
$char_id->{$char} = $id;
return $char;
}
sub char_id_in_text {
my ($text, @id) = @_;
foreach my $id (@id) {
if (length($id) == 1) {
$char_id->{$id} = $id;
$id_char->{$id} = $id;
next;
}
next if (exists $id_char->{$id});
my $char = apply_id_char($id);
if ($id =~ /^\w+$/) {
$text =~ s/\b$id\b/$char/g;
}
elsif ($id =~ /\w$/) {
$text =~ s/\Q$id\E\b/$char/g;
}
elsif ($id =~ /^\w/) {
$text =~ s/\b\Q$id\E/$char/g;
}
else {
$text =~ s/\Q$id\E/$char/g;
}
}
return $text;
}
sub nest_match {
my ($str, $rule) = @_;
my $match_start = {};
my $start_end_id = {};
while (my ($start_str, $end_str) = each %$rule) {
$str = char_id_in_text($str, $start_str, $end_str);
# say $str;
my $start_char = $id_char->{$start_str};
my $end_char = $id_char->{$end_str};
if (exists $match_start->{$start_char}) {
$match_start->{$start_char}{$end_char} = 1;
}
else {
$match_start->{$start_char} = { $end_char => 1 };
}
}
# default depth
my $depth = 0;
my $depth_chars = { 0 => [] };
# according depth to save matched string
my $depth_match_str = { };
my $depth_start_char = { 0 => '' };
my $expect_end_chars = {};
my @text_chars = split //, $str;
foreach my $char (@text_chars) {
if (exists $expect_end_chars->{$char}) {
push @{$depth_chars->{$depth}}, $char_id->{$char};
my $depth_str = join '', @{ $depth_chars->{$depth} };
if (exists $depth_match_str->{$depth}) {
push @{$depth_match_str->{$depth}}, $depth_str;
}
else {
$depth_match_str->{$depth} = [ $depth_str ];
}
$depth = $depth - 1;
push @{ $depth_chars->{$depth} }, $depth_str;
my $current_start_char = $depth_start_char->{$depth};
if ($depth == 0) {
$expect_end_chars = {};
} else {
$expect_end_chars = $match_start->{$current_start_char};
}
if (exists $match_start->{$char}) {
$depth = $depth + 1;
$depth_chars->{$depth} = [ $char_id->{$char} ];
$expect_end_chars = $match_start->{$char};
$depth_start_char->{$depth} = $char;
}
}
else {
if (exists $match_start->{$char}) {
$depth = $depth + 1;
$depth_chars->{$depth} = [ $char_id->{$char} ];
$expect_end_chars = $match_start->{$char};
$depth_start_char->{$depth} = $char;
}
else {
push @{ $depth_chars->{$depth} }, $char;
}
}
}
$count = 127; $id_char = {}; $char_id = {};
return $depth_match_str;
}
1;

復(fù)制代碼

實(shí)戰(zhàn)分享：從技術(shù)角度談機(jī)器學(xué)習(xí)入門| 【大話IT】RadonDB低門檻向MySQL集群下戰(zhàn)書 | ChinaUnix打賞功能已上線！ | 新一代分布式關(guān)系型數(shù)據(jù)庫RadonDB知多少？

rubyish

大富大貴

論壇徽章:: 7

15-16賽季CBA聯(lián)賽之青島
日期:2016-03-17 20:36:13

3樓 [報(bào)告]

發(fā)表于 2013-05-03 08:46 |只看該作者

3Q~這種算法很好

實(shí)戰(zhàn)分享：從技術(shù)角度談機(jī)器學(xué)習(xí)入門| 【大話IT】RadonDB低門檻向MySQL集群下戰(zhàn)書 | ChinaUnix打賞功能已上線！ | 新一代分布式關(guān)系型數(shù)據(jù)庫RadonDB知多少？

routesf

白手起家

論壇徽章:: 0

4樓 [報(bào)告]

發(fā)表于 2013-05-03 10:40 |只看該作者

回復(fù) 2# Perlvim

這是哪里的module,CPAN上沒查到，perlvim自己寫的?

實(shí)戰(zhàn)分享：從技術(shù)角度談機(jī)器學(xué)習(xí)入門| 【大話IT】RadonDB低門檻向MySQL集群下戰(zhàn)書 | ChinaUnix打賞功能已上線！ | 新一代分布式關(guān)系型數(shù)據(jù)庫RadonDB知多少？

routesf

白手起家

論壇徽章:: 0

5樓 [報(bào)告]

發(fā)表于 2013-05-03 14:52 |只看該作者

回復(fù) 4# routesf

我試了一下，括號(hào)似乎處理不了，類似這樣的
my $str3 = q(
interfaces {
fxp0 {
unit 0 {
      family inet {
      address 192.168.83.16/24;
      }
}
}
}
);
say(Dump(nest_match($str3, { '{' => '}' })));

實(shí)戰(zhàn)分享：從技術(shù)角度談機(jī)器學(xué)習(xí)入門| 【大話IT】RadonDB低門檻向MySQL集群下戰(zhàn)書 | ChinaUnix打賞功能已上線！ | 新一代分布式關(guān)系型數(shù)據(jù)庫RadonDB知多少？

Perlvim

小富即安

論壇徽章:: 0

6樓 [報(bào)告]

發(fā)表于 2013-05-03 18:13 |只看該作者

是的，代碼算法有問題，已經(jīng)更正。在第29行到30行之間增加一行：
$id_char->{$id} = $id;

實(shí)戰(zhàn)分享：從技術(shù)角度談機(jī)器學(xué)習(xí)入門| 【大話IT】RadonDB低門檻向MySQL集群下戰(zhàn)書 | ChinaUnix打賞功能已上線！ | 新一代分布式關(guān)系型數(shù)據(jù)庫RadonDB知多少？

followcn

白手起家

論壇徽章:: 0

7樓 [報(bào)告]

發(fā)表于 2013-05-06 11:46 |只看該作者

用調(diào)試的方式，好好學(xué)習(xí)了一下。
一個(gè)字符，一個(gè)字符的處理，這樣效率是否有問題？在兩個(gè)關(guān)鍵字之間的字符串能否一起處理呢。

實(shí)戰(zhàn)分享：從技術(shù)角度談機(jī)器學(xué)習(xí)入門| 【大話IT】RadonDB低門檻向MySQL集群下戰(zhàn)書 | ChinaUnix打賞功能已上線！ | 新一代分布式關(guān)系型數(shù)據(jù)庫RadonDB知多少？

Perlvim

小富即安

論壇徽章:: 0

8樓 [報(bào)告]

發(fā)表于 2013-05-06 12:55 |只看該作者

回復(fù) 7# followcn
所有字符串的算法，其實(shí)都是一個(gè)一個(gè)的處理。否則怎么知道到達(dá)了邊界？正則表達(dá)式看似很快，也是一個(gè)一個(gè)字符的處理。如果有分支和分組的話，一個(gè)字符要處理好幾次。所以分支的處理，分開后反而效率更高：

/branch1/ && /branch2/ => /branch1|branch2/

實(shí)戰(zhàn)分享：從技術(shù)角度談機(jī)器學(xué)習(xí)入門| 【大話IT】RadonDB低門檻向MySQL集群下戰(zhàn)書 | ChinaUnix打賞功能已上線！ | 新一代分布式關(guān)系型數(shù)據(jù)庫RadonDB知多少？

followcn

白手起家

論壇徽章:: 0

9樓 [報(bào)告]

發(fā)表于 2013-05-07 15:32 |只看該作者

本帖最后由 followcn 于 2013-05-07 15:34 編輯

可能是我的意思表達(dá)的不清楚，例如對(duì)下例中someting的處理
模塊中對(duì)something這樣的詞要循環(huán)9次，進(jìn)行判斷。
if somethting end

實(shí)戰(zhàn)分享：從技術(shù)角度談機(jī)器學(xué)習(xí)入門| 【大話IT】RadonDB低門檻向MySQL集群下戰(zhàn)書 | ChinaUnix打賞功能已上線！ | 新一代分布式關(guān)系型數(shù)據(jù)庫RadonDB知多少？

Perlvim

小富即安

論壇徽章:: 0

10樓 [報(bào)告]

發(fā)表于 2013-05-07 18:37 |只看該作者

如果想把類似的單詞先替換成一個(gè)字節(jié)的字符，讓掃描的時(shí)候，只掃描一次，這個(gè)算法就快多了。
你提醒了我，可以在將關(guān)鍵字替換成單字節(jié)字符的同時(shí)，將其他的單詞一起替換成單字節(jié)字符。

這樣，就快多了

實(shí)戰(zhàn)分享：從技術(shù)角度談機(jī)器學(xué)習(xí)入門| 【大話IT】RadonDB低門檻向MySQL集群下戰(zhàn)書 | ChinaUnix打賞功能已上線！ | 新一代分布式關(guān)系型數(shù)據(jù)庫RadonDB知多少？

亚洲av成人无遮挡网站在线观看,少妇性bbb搡bbb爽爽爽,亚洲av日韩精品久久久久久,兔费看少妇性l交大片免费,无码少妇一区二区三区

[探討]字符串匹配算法擴(kuò)展算法 [復(fù)制鏈接]


平臺(tái) 論壇博客文庫