functionalRust 的性能影响是什么

发布于04月14日

我正在沿着铁 rust 的轨道走Exercism.io英里.我有相当多的C/C++经验.我喜欢Rust的"功能性"元素，但我担心它的相对性能.

pub fn encode(source: &str) -> String {
    let mut retval = String::new();
    let firstchar = source.chars().next();
    let mut currentchar = match firstchar {
        Some(x) => x,
        None => return retval,
    };
    let mut currentcharcount: u32 = 0;
    for c in source.chars() {
        if c == currentchar {
            currentcharcount += 1;
        } else {
            if currentcharcount > 1 {
                retval.push_str(&currentcharcount.to_string());
            }
            retval.push(currentchar);
            currentchar = c;
            currentcharcount = 1;
        }
    }
    if currentcharcount > 1 {
        retval.push_str(&currentcharcount.to_string());
    }
    retval.push(currentchar);
    retval
}

我注意到其中一个排名靠前的答案看起来更像这样:

extern crate itertools;

use itertools::Itertools;

pub fn encode(data: &str) -> String {
    data.chars()
        .group_by(|&c| c)
        .into_iter()
        .map(|(c, group)| match group.count() {
            1 => c.to_string(),
            n => format!("{}{}", n, c),
        })
        .collect()
}

我喜欢一流的解决方案；它简单、实用、优雅.这就是他们向我promise 的一切.另一方面，我的 idea 是粗俗的，充满了可变变量.你可以告诉我我已经习惯C++了.

我的问题是功能性风格对性能有很大影响.我用相同的4MB随机数据对这两个版本进行了1000次编码测试.我的紧急解决方案只用了不到10秒；功能性溶液约为2min 30秒.

为什么功能性风格比命令式风格慢得多？
功能实现是否存在导致如此巨大减速的问题？
如果我想写高性能代码，我应该使用这种函数式风格吗？

pub fn encode_slim(data: &str) -> String { data.chars() .batching(|it| { it.next() .map(|v| (v, it.take_while_ref(|&v2| v2 == v).count() + 1)) }) .format_with("", |(c, count), f| match count { 1 => f(&c), n => f(&format_args!("{}{}", n, c)), }) .to_string() }

encode (procedural) time: [21.082 ms 21.620 ms 22.211 ms] encode (fast) time: [26.457 ms 27.104 ms 27.882 ms] Found 7 outliers among 100 measurements (7.00%) 4 (4.00%) high mild 3 (3.00%) high severe

struct RunLength { iter: I, saved: Option<char>, } impl RunLength where I: Iterator<Item = char>, { fn new(mut iter: I) -> Self { let saved = iter.next(); // See footnote 1 Self { iter, saved } } } impl Iterator for RunLength where I: Iterator<Item = char>, { type Item = (char, usize); fn next(&mut self) -> Option<Self::Item> { let c = self.saved.take().or_else(|| self.iter.next())?; let mut count = 1; while let Some(n) = self.iter.next() { if n == c { count += 1 } else { self.saved = Some(n); break; } } Some((c, count)) } } pub fn encode_tiny(data: &str) -> String { use std::fmt::Write; RunLength::new(data.chars()).fold(String::new(), |mut s, (c, count)| { match count { 1 => s.push(c), n => write!(&mut s, "{}{}", n, c).unwrap(), } s }) }

encode (procedural) time: [19.888 ms 20.301 ms 20.794 ms] Found 4 outliers among 100 measurements (4.00%) 3 (3.00%) high mild 1 (1.00%) high severe encode (tiny) time: [19.150 ms 19.262 ms 19.399 ms] Found 11 outliers among 100 measurements (11.00%) 5 (5.00%) high mild 6 (6.00%) high severe

use criterion::{criterion_group, criterion_main, Criterion}; // 0.2.11 use rle::*; fn criterion_benchmark(c: &mut Criterion) { let data = rand_data(4 * 1024 * 1024); c.bench_function("encode (procedural)", { let data = data.clone(); move |b| b.iter(|| encode_proc(&data)) }); c.bench_function("encode (functional)", { let data = data.clone(); move |b| b.iter(|| encode_iter(&data)) }); c.bench_function("encode (fast)", { let data = data.clone(); move |b| b.iter(|| encode_slim(&data)) }); c.bench_function("encode (tiny)", { let data = data.clone(); move |b| b.iter(|| encode_tiny(&data)) }); } criterion_group!(benches, criterion_benchmark); criterion_main!(benches);

use itertools::Itertools; // 0.8.0 use rand; // 0.6.5 pub fn rand_data(len: usize) -> String { use rand::distributions::{Alphanumeric, Distribution}; let mut rng = rand::thread_rng(); Alphanumeric.sample_iter(&mut rng).take(len).collect() } pub fn encode_proc(source: &str) -> String { let mut retval = String::new(); let firstchar = source.chars().next(); let mut currentchar = match firstchar { Some(x) => x, None => return retval, }; let mut currentcharcount: u32 = 0; for c in source.chars() { if c == currentchar { currentcharcount += 1; } else { if currentcharcount > 1 { retval.push_str(&currentcharcount.to_string()); } retval.push(currentchar); currentchar = c; currentcharcount = 1; } } if currentcharcount > 1 { retval.push_str(&currentcharcount.to_string()); } retval.push(currentchar); retval } pub fn encode_iter(data: &str) -> String { data.chars() .group_by(|&c| c) .into_iter() .map(|(c, group)| match group.count() { 1 => c.to_string(), n => format!("{}{}", n, c), }) .collect() } pub fn encode_slim(data: &str) -> String { data.chars() .batching(|it| { it.next() .map(|v| (v, it.take_while_ref(|&v2| v2 == v).count() + 1)) }) .format_with("", |(c, count), f| match count { 1 => f(&c), n => f(&format_args!("{}{}", n, c)), }) .to_string() } struct RunLength { iter: I, saved: Option<char>, } impl RunLength where I: Iterator<Item = char>, { fn new(mut iter: I) -> Self { let saved = iter.next(); Self { iter, saved } } } impl Iterator for RunLength where I: Iterator<Item = char>, { type Item = (char, usize); fn next(&mut self) -> Option<Self::Item> { let c = self.saved.take().or_else(|| self.iter.next())?; let mut count = 1; while let Some(n) = self.iter.next() { if n == c { count += 1 } else { self.saved = Some(n); break; } } Some((c, count)) } } pub fn encode_tiny(data: &str) -> String { use std::fmt::Write; RunLength::new(data.chars()).fold(String::new(), |mut s, (c, count)| { match count { 1 => s.push(c), n => write!(&mut s, "{}{}", n, c).unwrap(), } s }) } #[cfg(test)] mod test { use super::*; #[test] fn all_the_same() { let data = rand_data(1024); let a = encode_proc(&data); let b = encode_iter(&data); let c = encode_slim(&data); let d = encode_tiny(&data); assert_eq!(a, b); assert_eq!(a, c); assert_eq!(a, d); } }

functionalRust 的性能影响是什么

推荐答案

支持代码

Rust相关问答推荐

为什么是！为Rust中的RwLockReadGuard和RwLockWriteGuard实现的发送特征？

Rust kill std：：processs：：child

如何使用字符串迭代器执行查找？

编译项目期间使用Cargo生成时出现rustc错误

如何导入crate-type=["；cdylib；]库？

铁 rust ，我的模块介绍突然遇到了一个问题

如何在函数中返回自定义字符串引用？

我如何使用AWS SDK for Rust获取我承担的角色的凭据？

为什么 tokio 在以奇怪的方式调用时只运行 n 个任务中的 n-1 个？

为什么 `Deref` 没有在 `Cell` 上实现？

将特征与具有生命周期的关联类型一起使用时的生命周期方差问题

如何在 `connect_activate()` 之外创建一个 `glib：：MainContext：：channel()` 并将其传入？

具有多个键的 HashMap

Rust 中指向自身的引用如何工作？

Rust 异步循环函数阻塞了其他future 任务的执行

为什么不可变特征的实现可以是可变的？

使用部分键从 Hashmap 中检索值

我如何将 google_gmail1：：Gmail> 传递给线程生成？

为什么这个值在上次使用后没有下降？

在 macro_rules 中转义 $ 美元符号