在2024 Clojure状态调查！分享您的想法。

Question

`doseq`使用多个绑定导致“ClassFormatError: 无效的方法代码长度”

提问 2014年1月10日在Clojure 由 jira

重要性能提示新的实现对于自定义reducible但非分块的集合更快，对于绑定数量也更快。原始实现针对分块集合进行了手工调优，在分块集合较大且绑定数量较少的情况下具有优势，这可能是由于reduce的fn调用/返回跟踪开销造成的。详细信息请参考注释。
筛选者
补丁 doseq.patch

`
user=> (def a1 (range 10))

'user/a1

user=> (doseq [x1 a1 x2 a1 x3 a1 x4 a1 x5 a1 x6 a1 x7 a1 x8 a1] (do))
CompilerException java.lang.ClassFormatError: 类文件user$eval1032中的方法代码长度无效 69883 编译：（NO_SOURCE_PATH:2:1）
`

虽然这个例子很愚蠢，但我们多次遇到过这个问题。当你的代码只有几行时，突然出现代码长度错误，这会很令人惊讶。

15 答案

jira · Answer 1 · 2014-04-18T06:20:00+0000

评论者：hiredman

在jdk 1.8.0和clojure 1.6可复制

jira · Answer 2 · 2014-04-22T23:35:51+0000

评论者：bronsa

针对此问题的潜在修复方法是在dosenq中生成中间函数，就像for那样直接展开所有代码。

jira · Answer 3 · 2014-06-26T02:39:27+0000

评论者：gshayban

Existing doseq handles chunked-traversal internally, deciding the
mechanics of traversal for a seq.  In addition to possibly conflating
concerns, this is causing a code explosion blowup when more bindings are
added, approx 240 bytes of bytecode per binding (without modifiers).

This approach redefs doseq later in core.clj, after protocol-based
reduce (and other modern conveniences like destructuring.)

It supports the existing :let, :while, and :when modifiers.

New is a stronger assertion that modifiers cannot come before binding
expressions.  (Same semantics as let, i.e. left to right)

valid:  (link: x coll :when (foo x))
invalid: (link: :when (foo x) x coll)

This implementation does not suffer from the code explosion problem.
About 25 bytes of bytecode + 1 fn per binding.

Implementing this without destructuring was not a party, luckily reduce
is defined later in core.

jira · Answer 4 · 2014-06-26T06:25:21+0000

评论由：jafingerhut 提出

对于任何审核此补丁的人，请注意，在文件 test/clojure/test_clojure/for.clj 中已有许多针对 doseq 正确功能的测试。这可能并不明显，但每个使用 deftest-both 定义的 'for' 测试，既是 'for' 测试，也是 'doseq' 测试。

关于 doseq 当前的实现：问题不仅仅在于每次绑定都会使字节数过多，还在于代码大小随着每次额外的绑定而逐倍增加。请看这些结果，这些结果是对宏展开形式的尺寸的度量，而不是字节码大小，但这两者在这里应该有比较线性的关系

`
(defn formsize [form]
(count (with-out-str (print (macroexpand form)))))

user=> (formsize '(doseq [x (range 10)] (print x)))
652
user=> (formsize '(doseq [x (range 10) y (range 10)] (print x y)))
1960
user=> (formsize '(doseq [x (range 10) y (range 10) z (range 10)] (print x y z)))
4584
user=> (formsize '(doseq [x (range 10) y (range 10) z (range 10) w (range 10)] (print x y z w)))
9947
user=> (formsize '(doseq [x (range 10) y (range 10) z (range 10) w (range 10) p (range 10)] (print x y z w p)))
20997
`

以下是 2014 年 6 月 25 日 Ghadi 补丁 doseq.patch 之后的相同表达式的结果

user=> (formsize '(doseq [x (range 10)] (print x))) 93 user=> (formsize '(doseq [x (range 10) y (range 10)] (print x y))) 170 user=> (formsize '(doseq [x (range 10) y (range 10) z (range 10)] (print x y z))) 247 user=> (formsize '(doseq [x (range 10) y (range 10) z (range 10) w (range 10)] (print x y z w))) 324 user=> (formsize '(doseq [x (range 10) y (range 10) z (range 10) w (range 10) p (range 10)] (print x y z w p))) 401

同样希望看到有此补丁和无此补丁的性能结果。

jira · Answer 5 · 2014-06-28T20:21:55+0000

评论由：stu 提出

以下测试中，新实现在 doseq2 中，而原始实现在 doseq 中

`
(def hund (into [] (range 100)))
(def ten (into [] (range 10)))
(def arr (int-array 100))
(def s "superduper")

;; 大序列，绑定数较少：doseq2 输赢
(dotimes [_ 5]
(time (doseq [a (range 100000000)])))
;; 1.2 秒

(dotimes [_ 5]
(time (doseq2 [a (range 100000000)])))
;; 1.8 秒

;; 小型未分块的可归约序列，绑定数较少：doseq2 获胜
(dotimes [_ 5]
(time (doseq [a s b s c s])))
;; 0.5 秒

(dotimes [_ 5]
(time (doseq2 [a s b s c s])))
;; 0.2 秒

(dotimes [_ 5]
(time (doseq [a arr b arr c arr])))
;; 40 毫秒

(dotimes [_ 5]
(time (doseq2 [a arr b arr c arr])))
;; 8 毫秒

;; 小型分块可归约序列，绑定数较少：doseq2 输赢
(dotimes [_ 5]
(time (doseq [a hund b hund c hund])))
;; 2 毫秒

(dotimes [_ 5]
(time (doseq2 [a hund b hund c hund])))
;; 8 毫秒

;; 绑定数增多：doseq2 的优势越来越大
(dotimes [_ 5]
(time (doseq [a ten b ten c ten d ten ])))
;; 2 毫秒

(dotimes [_ 5]
(time (doseq2 [a ten b ten c ten d ten ])))
;; 0.4 毫秒

(dotimes [_ 5]
(time (doseq [a ten b ten c ten d ten e ten])))
;; 18 毫秒

(dotimes [_ 5]
(时间 (doseq2 [a 十 b 十 c 十 d 十 e 十]))
;; 1 毫秒
`

jira · Answer 6 · 2014-06-29T00:23:42+0000

评论者：gshayban

嗯，我无法重现你的结果。

你是在使用 lein 进行测试吗？在什么平台上，什么 JVM 选项？

我们可以用这个小装备直接针对 clojure.jar 进行测试，而不是直接测试？我已经附上了装备及其两次运行结果（一个是默认堆大小，另一个是 3GB G1GC 堆大小）

我还添加了中等和小的（范围）。

据我所知，doseq2 在所有情况下都优于 doseq2，除了小的范围。使用 criterium 显示 doseq2 的性能差距更大。

我已经将结果并排粘贴，以便于查看。

`
core/doseq doseq2
"耗时：1610.865146 毫秒" "耗时：2315.427573 毫秒"
"耗时：2561.079069 毫秒" "耗时：2232.479584 毫秒"
"耗时：2446.674237 毫秒" "耗时：2234.556301 毫秒"
"耗时：2443.129809 毫秒" "耗时：2224.302855 毫秒"
"耗时：2456.406103 毫秒" "耗时：2210.383112 毫秒"

;; 中等范围，少绑定
core/doseq doseq2
"耗时：28.383197 毫秒" "耗时：31.676448 毫秒"
"耗时：13.908323 毫秒" "耗时：11.136818 毫秒"
"耗时：18.956345 毫秒" "耗时：11.137122 毫秒"
"耗时：12.367901 毫秒" "耗时：11.049121 毫秒"
"耗时：13.449006 毫秒" "耗时：11.141385 毫秒"

;; 小范围，少绑定
core/doseq doseq2
"耗时：0.386334 毫秒" "耗时：0.372388 毫秒"
"耗时：0.10521 毫秒" "耗时：0.203328 毫秒"
"耗时：0.083378 毫秒" "耗时：0.179116 毫秒"
"耗时：0.097281 毫秒" "耗时：0.150563 毫秒"
"耗时：0.095649 毫秒" "耗时：0.167609 毫秒"

;; 小无块可缩减，少绑定
core/doseq doseq2
"耗时：2.351466 毫秒" "耗时：2.749858 毫秒"
"耗时：0.755616 毫秒" "耗时：0.80578 毫秒"
"耗时：0.664072 毫秒" "耗时：0.661074 毫秒"
"耗时：0.549186 毫秒" "耗时：0.712239 毫秒"
"耗时：0.551442 毫秒" "耗时：0.518207 毫秒"

core/doseq doseq2
"耗时：95.237101 毫秒" "耗时：55.3067 毫秒"
"耗时：41.030972 毫秒" "耗时：30.817747 毫秒"
"耗时：42.107288 毫秒" "耗时：19.535747 毫秒"
"耗时：41.088291 毫秒" "耗时：4.099174 毫秒"
"耗时：41.03616 毫秒" "耗时：4.084832 毫秒"

;; 小块可缩减，少绑定
core/doseq doseq2
"耗时：31.793603 毫秒" "耗时：40.082492 毫秒"
"耗时：17.302798 毫秒" "耗时：28.286991 毫秒"
"耗时：17.212189 毫秒" "耗时：14.897374 毫秒"
"耗时：17.266534 毫秒" "耗时：10.248547 毫秒"
"耗时：17.227381 毫秒" "耗时：10.022326 毫秒"

;; 更多绑定
core/doseq doseq2
"耗时：4.418727 毫秒" "耗时：2.685198 毫秒"
"耗时：2.421063 毫秒" "耗时：2.384134 毫秒"
"耗时：2.210393 毫秒" "耗时：2.341696 毫秒"
"耗时：2.450744 毫秒" "耗时：2.339638 毫秒"
"耗时：2.223919 毫秒" "耗时：2.372942 毫秒"

core/doseq doseq2
"耗时：28.869393 毫秒" "耗时：2.997713 毫秒"
"耗时：22.414038 毫秒" "耗时：1.807955 毫秒"
"耗时：21.913959 毫秒" "耗时：1.870567 毫秒"
"耗时：22.357315 毫秒" "耗时：1.904163 毫秒"
"耗时：21.138915 毫秒" "耗时：1.694175 毫秒"
`

jira · Answer 7 · 2014-06-29T00:47:56+0000

评论者：gshayban

基准测试中包含空的 doseq 主体是为了隔离遍历的开销，这是好的。然而，这仅代表了实际现实代码的 0%。

至少对于第一个基准（大块分片 seq）来说，添加少量的工作并没有显著改变结果。对于 (map str (link: a)) 也是如此

`
(range 10000000) => (map str [a])
core/doseq
"耗时：586.822389 毫秒"
"耗时：563.640203 毫秒"
"耗时：369.922975 毫秒"
"耗时：366.164601 毫秒"
"耗时：373.27327 毫秒"
doseq2
"耗时：419.704021 毫秒"
"耗时：371.065783 毫秒"
"耗时：358.779231 毫秒"
"耗时：363.874448 毫秒"
"耗时：368.059586 毫秒"

`

对于像 (inc a) 这样的内建函数也不是

`

(range 10000000)
core/doseq
"耗时：317.091849 毫秒"
"耗时：272.360988 毫秒"
"耗时：215.501737 毫秒"
"耗时：206.639181 毫秒"
"耗时：206.883343 毫秒"
doseq2
"耗时：241.475974 毫秒"
"耗时：193.154832 毫秒"
"耗时：198.757873 毫秒"
"耗时：197.803042 毫秒"
"耗时：200.603786 毫秒"
`

我还是看到基于 reduce 的 doseq 在原始版本之前，除了对于小的 seqs

jira · Answer 8 · 2014-08-04T20:55:01+0000

评论者：gshayban

以下形式的代码不会与这个补丁一起工作

(go (doseq [c chs] (>! c :foo)))

因为 go 宏不会遍历fn边界。我所知的唯一这种代码是 core.async/mapcat**，这是一个支持已标记为弃用的fn的私有fn。

jira · Answer 9 · 2014-08-07T20:09:53+0000

评论者：gshayban

我发现刚刚添加了 '#clojure.core/run!'，它也有类似的限制

jira · Answer 10 · 2014-08-29T14:19:39+0000

评论者：richhickey

请考虑 Ghadi 的反馈，特别是关于闭包的方面。

jira · Answer 11 · 2014-09-22T22:36:55+0000

评论者：gshayban

在go形式下，由于控制流程的数量，当前dosesq的扩展并不理想，与使用循环/递归相比，状态机的状态数从7增加到14。

(link: 1) 比较(go ... doseq)与(go ... loop/recur)的宏扩展
https://gist.github.com/ghadishayban/639009900ce1933256a1

jira · Answer 12 · 2015-09-03T18:59:58+0000

评论者：bronsa

对于这个问题的一般解决方案是，使用类似https://bitbucket.org/sperber/asm-method-size的东西在方法太大时自动拆分方法。

jira · Answer 13 · 2016-04-24T06:25:31+0000

评论者：gshayban

示例dosesq实现和宏扩展，不会出现指数级的字节码增长。它也不使用任何lambda，因此适合core.async。
https://gist.github.com/ghadishayban/fe84eb8db78f23be68e20addf467c4d4
它使用显式栈来存放seqs/bindings。
它尚未处理任何分块或修饰符。

jira · Answer 14 · 2019-06-26T12:00:00+0000

参考：https://clojure.atlassian.net/browse/CLJ-1322 (由arcatan报告)

在2024 Clojure状态调查！分享您的想法。

`doseq`使用多个绑定导致“ClassFormatError: 无效的方法代码长度”

'user/a1

请登录或注册以添加评论。

请登录或注册以回答这个问题。

15 答案

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

类别

在2024 Clojure状态调查！分享您的想法。

doseq使用多个绑定导致“ClassFormatError: 无效的方法代码长度”

'user/a1

请登录或注册以添加评论。

请登录或注册以回答这个问题。

15 答案

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

请登录或注册以添加评论。

相关问题

类别

`doseq`使用多个绑定导致“ClassFormatError: 无效的方法代码长度”